pith. sign in

arxiv: 2605.28986 · v1 · pith:CA7RADTAnew · submitted 2026-05-27 · 🪐 quant-ph

Comparing Classical Simulation and Sample-Based Learning of Quantum Systems: Learning the Hardness of Quantum Systems from Samples

Pith reviewed 2026-06-29 11:11 UTC · model grok-4.3

classification 🪐 quant-ph
keywords quantum simulationsample-based learningentanglementnon-stabilizernessgenerative modelshessian eigenvaluequantum hardnessneural networks
0
0 comments X

The pith

Within the regimes studied, sample-based learning difficulty for quantum states tracks their classical simulation hardness from entanglement and non-stabilizerness.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines whether the ease of learning quantum states from measurement samples matches how hard they are to simulate classically. It tests this by training one fixed neural network model on samples from states where entanglement or non-stabilizerness is increased in controlled ways. If the two measures align, then the way neural networks train could serve as a practical test for quantum computational hardness. A reader might care because it connects theoretical complexity ideas to observable training behavior in models applied to quantum data.

Core claim

The central claim is that for the regimes studied, classical learnability tracks known simulation complexity measures. Increasing the bond dimension of matrix product states or the number of T gates in Clifford-dominated circuits leads to larger largest Hessian eigenvalues at convergence and degraded reconstruction performance under Random Subspace Optimization for the energy-based generative model.

What carries the argument

The observed tracking between simulation cost parameters (bond dimension for entanglement, T-gate count for non-stabilizerness) and neural-network complexity probes (largest Hessian eigenvalue, Random Subspace Optimization).

If this is right

  • Neural-network training dynamics can provide an empirical probe of quantum computational hardness.
  • Learnability from samples serves as a proxy for simulation complexity when entanglement or non-stabilizerness is varied.
  • Both entanglement and non-stabilizerness increase learning difficulty in the same direction as they increase simulation cost.
  • Reconstruction performance degrades under constrained model capacity as either quantum resource grows.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The correlation might be tested on other controlled families beyond matrix product states and Clifford-plus-T circuits.
  • If the pattern holds, it could help identify quantum data sets that are practically hard for classical learning algorithms.
  • Training dynamics might offer a way to rank the effective hardness of quantum systems without running full classical simulations.

Load-bearing premise

The fixed energy-based generative model architecture together with the two chosen probes yield a representative measure of learnability that is not an artifact of this specific model class or of the particular controlled state families.

What would settle it

A counterexample family of quantum states where simulation cost is high yet the model learns the samples to high accuracy, or where simulation cost is low yet learning fails under the same training conditions.

Figures

Figures reproduced from arXiv: 2605.28986 by Fernando de Melo, Jo\~ao Pedro Del Rey, Ra\'ul O. Vallejos.

Figure 1
Figure 1. Figure 1: FIG. 1 [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2 [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4 [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3 [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7. Number of training epochs required for convergence [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5. Entanglement entropy [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6. Entanglement entropy [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 8
Figure 8. Figure 8: FIG. 8. RSO reconstruction error [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗
read the original abstract

We investigate the relationship between two distinct classical approaches to quantum systems: direct simulation from a classical description and sample-based learning from measurement data. While both tasks ultimately aim to reproduce Born-rule statistics, complexity-theoretic results suggest that simulability and learnability need not coincide in general. Here we study this relationship empirically using a fixed deep energy-based generative model trained on measurement samples from controlled families of quantum states. We independently tune two quantum resources associated with classical simulation cost: entanglement, through the bond dimension of random matrix product states, and non-stabilizerness, through the number of T gates in Clifford-dominated circuits. Learning difficulty is characterized using two probes of neural-network complexity: the largest Hessian eigenvalue at convergence and Random Subspace Optimization. For both quantum resources, increasing simulation hardness systematically correlates with sharper loss landscapes and degraded reconstruction performance under constrained capacity. Our results indicate that, within the regimes studied here, classical learnability tracks known simulation complexity measures, suggesting that neural-network training dynamics can provide an empirical probe of quantum computational hardness.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The paper empirically studies the link between classical simulation hardness and sample-based learnability of quantum states. Using a fixed deep energy-based generative model, it independently varies bond dimension in random matrix-product states (controlling entanglement) and T-gate count in Clifford+T circuits (controlling non-stabilizerness), then measures learning difficulty via the largest Hessian eigenvalue at convergence and performance under Random Subspace Optimization. The central finding is that increasing either simulation-cost parameter systematically correlates with sharper loss landscapes and degraded reconstruction under capacity constraints, suggesting neural-network training dynamics can empirically probe quantum computational hardness.

Significance. If the correlation holds beyond the specific model class, the work supplies a concrete empirical bridge between two classically distinct tasks (simulation from description versus learning from samples) that complexity theory suggests need not coincide. The controlled, independent variation of two hardness parameters is a methodological strength. However, the absence of architecture ablations or alternative learnability metrics leaves open whether the observed tracking is general or an artifact of the chosen energy-based model and probes.

major comments (2)
  1. [Abstract] Abstract (description of experimental setup): the central claim that 'classical learnability tracks known simulation complexity measures' rests on the assumption that the fixed deep energy-based generative model together with the Hessian-eigenvalue and Random-Subspace-Optimization probes yield a representative measure of learnability. No ablation across model architectures, training procedures, or alternative learnability metrics is described, so any systematic bias in how this network family encodes Born-rule statistics of the chosen ensembles would produce a spurious correlation.
  2. [Abstract] Abstract (reporting of results): the text states that 'increasing simulation hardness systematically correlates with sharper loss landscapes and degraded reconstruction' but supplies no statistical details, error bars, number of independent runs, or significance tests. Without these, the reported 'consistent trends in two learning probes' remain plausible yet unverified.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below, indicating planned revisions where appropriate.

read point-by-point responses
  1. Referee: [Abstract] Abstract (description of experimental setup): the central claim that 'classical learnability tracks known simulation complexity measures' rests on the assumption that the fixed deep energy-based generative model together with the Hessian-eigenvalue and Random-Subspace-Optimization probes yield a representative measure of learnability. No ablation across model architectures, training procedures, or alternative learnability metrics is described, so any systematic bias in how this network family encodes Born-rule statistics of the chosen ensembles would produce a spurious correlation.

    Authors: Our experimental design deliberately fixes the learning architecture to isolate the effects of varying the two quantum hardness parameters (bond dimension and T-gate count) while holding all other factors constant. This controlled setup directly tests whether the chosen learnability probes track simulation cost. We agree that the absence of architecture ablations leaves open the question of generality, and we will add an explicit discussion of this scope limitation together with suggestions for future work in the revised manuscript. revision: partial

  2. Referee: [Abstract] Abstract (reporting of results): the text states that 'increasing simulation hardness systematically correlates with sharper loss landscapes and degraded reconstruction' but supplies no statistical details, error bars, number of independent runs, or significance tests. Without these, the reported 'consistent trends in two learning probes' remain plausible yet unverified.

    Authors: The main text and figures report results averaged over multiple independent runs with error bars; we will revise the abstract to include a concise statement of these statistical details (e.g., number of runs and consistency of trends). revision: yes

Circularity Check

0 steps flagged

No significant circularity: empirical correlation between independently measured quantities.

full rationale

The paper reports an empirical study in which simulation hardness is controlled via externally defined parameters (bond dimension of random MPS, T-gate count in Clifford+T circuits) while learnability is assessed via separate neural-network probes (largest Hessian eigenvalue at convergence, Random Subspace Optimization performance). No derivation, equation, or self-citation reduces one set of quantities to the other by construction or by fitting a parameter that is then renamed as a prediction. The central claim is an observed correlation within fixed model families, not a self-referential identity.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on standard domain assumptions from quantum information and machine learning; no new free parameters, axioms, or invented entities are introduced in the abstract.

axioms (2)
  • domain assumption The chosen energy-based generative model is capable of approximating the Born-rule statistics of the sampled quantum states under sufficient training.
    Implicit in the use of the model to reproduce measurement statistics.
  • domain assumption Largest Hessian eigenvalue at convergence and Random Subspace Optimization performance are valid quantitative proxies for learning difficulty.
    Used without further justification to characterize learning difficulty.

pith-pipeline@v0.9.1-grok · 5721 in / 1254 out tokens · 45385 ms · 2026-06-29T11:11:25.588664+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

73 extracted references · 20 canonical work pages · 13 internal anchors

  1. [1]

    A further limitation concerns the measurement basis

    – the smallestdthat recovers a target fraction of full- model performance – would provide a single number per dataset that aims to be consistent across architectures, and would allow a more direct comparison between the two resources. A further limitation concerns the measurement basis. Our model is trained exclusively on Born distributions in the computa...

  2. [2]

    M. J. Bremner, R. Jozsa, and D. J. Shepherd, Proceed- ings of the Royal Society A: Mathematical, Physical and Engineering Sciences467, 459 (2011)

  3. [3]

    Aaronson and A

    S. Aaronson and A. Arkhipov, inProceedings of the forty- third annual ACM symposium on Theory of computing (2011) pp. 333–342

  4. [4]

    Aaronson, Proceedings of the Royal Society A: Math- ematical, Physical and Engineering Sciences463, 3089 (2007)

    S. Aaronson, Proceedings of the Royal Society A: Math- ematical, Physical and Engineering Sciences463, 3089 (2007)

  5. [5]

    Hinsche, M

    M. Hinsche, M. Ioannou, A. Nietner, J. Haferkamp, Y. Quek, D. Hangleiter, J.-P. Seifert, J. Eisert, and R. Sweke, Physical Review Letters130, 240602 (2023), arXiv:2207.03140 [quant-ph]

  6. [6]

    Torlai, G

    G. Torlai, G. Mazzola, J. Carrasquilla, M. Troyer, R. Melko, and G. Carleo, Nature physics14, 447 (2018)

  7. [7]

    Carrasquilla and R

    J. Carrasquilla and R. G. Melko, Nature Physics13, 431 (2017)

  8. [8]

    Carrasquilla, Advances in Physics: X5, 1797528 (2020)

    J. Carrasquilla, Advances in Physics: X5, 1797528 (2020)

  9. [9]

    Y. Du, Y. Zhu, Y.-H. Zhang, M.-H. Hsieh, P. Rebentrost, W. Gao, Y.-D. Wu, J. Eisert, G. Chiribella, D. Tao,et al., arXiv preprint arXiv:2509.04923 (2025)

  10. [10]

    Bittel, A

    L. Bittel, A. A. Mele, J. Eisert, and L. Leone, Quantum 9, 1665 (2025)

  11. [11]

    Huang, R

    H.-Y. Huang, R. Kueng, G. Torlai, V. V. Albert, and J. Preskill, Science377, eabk3333 (2022)

  12. [12]

    A condition under which classical simulability implies efficient state learnability

    M. Yoganathan, A condition under which classical simulability implies efficient state learnability (2019), arXiv:1907.08163 [quant-ph]

  13. [13]

    Ippoliti and V

    M. Ippoliti and V. Khemani, PRX Quantum5, 020304 (2024)

  14. [14]

    H. Kim, A. Kumar, Y. Zhou, Y. Xu, R. Vasseur, and E.-A. Kim, arXiv preprint arXiv:2508.15895 (2025)

  15. [15]

    Kliesch and I

    M. Kliesch and I. Roth, PRX quantum2, 010201 (2021)

  16. [16]

    Hangleiter and J

    D. Hangleiter and J. Eisert, Reviews of Modern Physics 95, 035001 (2023)

  17. [17]

    H. T. Nguyen, P. Krishnan, D. Krishnaswamy, M. Us- man, and R. Buyya, arXiv preprint arXiv:2404.11420 (2024)

  18. [18]

    M. Y. Niu, A. M. Dai, L. Li, A. Odena, Z. Zhao, V. Smelyanskyi, H. Neven, and S. Boixo, arXiv preprint arXiv:2010.11983 (2020)

  19. [19]

    W. J. Maddox, G. Benton, and A. G. Wilson, arXiv preprint arXiv:2003.02139 (2020)

  20. [20]

    C. Li, H. Farkhoor, R. Liu, and J. Yosinski, arXiv preprint arXiv:1804.08838 (2018)

  21. [21]

    Ghorbani, S

    B. Ghorbani, S. Krishnan, and Y. Xiao, inInterna- tional Conference on Machine Learning(PMLR, 2019) pp. 2232–2241

  22. [22]

    Schollw¨ ock, Annals of physics326, 96 (2011)

    U. Schollw¨ ock, Annals of physics326, 96 (2011)

  23. [23]

    Bravyi and D

    S. Bravyi and D. Gosset, Physical review letters116, 250501 (2016)

  24. [24]

    Empirical Analysis of the Hessian of Over-Parametrized Neural Networks

    L. Sagun, U. Evci, V. U. Guney, Y. Dauphin, and L. Bot- tou, arXiv preprint arXiv:1706.04454 (2017)

  25. [25]
  26. [26]

    The Heisenberg Representation of Quantum Computers

    D. Gottesman, arXiv preprint quant-ph/9807006 (1998)

  27. [27]

    Aaronson and D

    S. Aaronson and D. Gottesman, Physical Review A—Atomic, Molecular, and Optical Physics70, 052328 (2004)

  28. [28]

    Bravyi and A

    S. Bravyi and A. Kitaev, Physical Review A—Atomic, Molecular, and Optical Physics71, 022316 (2005)

  29. [29]

    L. Chen, R. J. Garcia, K. Bu, and A. Jaffe, Physical Review B109, 174207 (2024)

  30. [30]

    Vidal, Physical review letters91, 147902 (2003)

    G. Vidal, Physical review letters91, 147902 (2003). 9

  31. [31]

    S. Zhou, Z. Yang, A. Hamma, and C. Chamon, SciPost Physics9, 087 (2020)

  32. [32]

    Iannotti, G

    D. Iannotti, G. Esposito, L. C. Venuti, and A. Hamma, Quantum9, 1797 (2025)

  33. [33]

    Gray, Journal of Open Source Software3, 819 (2018)

    J. Gray, Journal of Open Source Software3, 819 (2018)

  34. [34]

    Garnerone, T

    S. Garnerone, T. R. de Oliveira, and P. Zanardi, Physical Review A—Atomic, Molecular, and Optical Physics81, 032336 (2010)

  35. [35]

    Quantum computing with Qiskit

    A. Javadi-Abhari, M. Treinish, K. Krsulich, C. J. Wood, J. Lishman, J. Gacon, S. Martiel, P. D. Na- tion, L. S. Bishop, A. W. Cross,et al., arXiv preprint arXiv:2405.08810 (2024)

  36. [36]

    G. E. Hinton, Neural computation14, 1771 (2002)

  37. [37]

    N. S. Keskar, D. Mudigere, J. Nocedal, M. Smelyan- skiy, and P. T. P. Tang, arXiv preprint arXiv:1609.04836 (2016)

  38. [38]

    Sharpness-Aware Minimization for Efficiently Improving Generalization

    P. Foret, A. Kleiner, H. Mobahi, and B. Neyshabur, arXiv preprint arXiv:2010.01412 (2020)

  39. [39]

    A note on the evaluation of generative models

    L. Theis, A. v. d. Oord, and M. Bethge, arXiv preprint arXiv:1511.01844 (2015)

  40. [40]

    Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond

    L. Sagun, L. Bottou, and Y. LeCun, arXiv preprint arXiv:1611.07476 (2016)

  41. [41]

    Gradient Descent Happens in a Tiny Subspace

    G. Gur-Ari, D. A. Roberts, and E. Dyer, arXiv preprint arXiv:1812.04754 (2018)

  42. [42]

    Fort and S

    S. Fort and S. Ganguli, arXiv preprint arXiv:1910.05929 (2019)

  43. [43]

    Chaudhari, A

    P. Chaudhari, A. Choromanska, S. Soatto, Y. Le- Cun, C. Baldassi, C. Borgs, J. Chayes, L. Sagun, and R. Zecchina, Journal of Statistical Mechanics: Theory and Experiment2019, 124018 (2019)

  44. [44]

    LeCun, J

    Y. LeCun, J. Denker, and S. Solla, Advances in neural information processing systems2(1989)

  45. [45]

    Frantar and D

    E. Frantar and D. Alistarh, inInternational conference on machine learning(PMLR, 2023) pp. 10323–10337

  46. [46]

    P. W. Koh and P. Liang, inInternational conference on machine learning(PMLR, 2017) pp. 1885–1894

  47. [47]

    Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

    A. Power, Y. Burda, H. Edwards, I. Babuschkin, and V. Misra, arXiv preprint arXiv:2201.02177 (2022)

  48. [48]

    T. Han, L. Adilova, H. Petzka, J. Kleesiek, and M. Kamp, Advances in Neural Information Processing Systems38, 106766 (2026)

  49. [49]

    Bengio, A

    Y. Bengio, A. Courville, and P. Vincent, IEEE transac- tions on pattern analysis and machine intelligence35, 1798 (2013)

  50. [50]

    E. J. Hu, Y. Shen, P. Wallis, Z. Allen-Zhu, Y. Li, S. Wang, L. Wang, W. Chen,et al., Iclr1, 3 (2022)

  51. [51]

    Dettmers, M

    T. Dettmers, M. Lewis, Y. Belkada, and L. Zettlemoyer, Advances in neural information processing systems35, 30318 (2022)

  52. [52]

    The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

    J. Frankle and M. Carbin, arXiv preprint arXiv:1803.03635 (2018)

  53. [53]

    Barton, J

    B. Barton, J. Carrasquilla, C. Roth, and A. Valenti, Ma- chine Learning: Science and Technology (2026)

  54. [54]

    Raghu, B

    M. Raghu, B. Poole, J. Kleinberg, S. Ganguli, and J. Sohl-Dickstein, ininternational conference on machine learning(PMLR, 2017) pp. 2847–2854

  55. [55]

    Aghajanyan, S

    A. Aghajanyan, S. Gupta, and L. Zettlemoyer, inPro- ceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (volume 1: long papers)(2021) pp. 7319–7328

  56. [56]

    M. A. Nielsen and I. L. Chuang,Quantum computation and quantum information(Cambridge university press, 2010)

  57. [57]

    T. M. Cover,Elements of information theory(John Wi- ley & Sons, 1999)

  58. [58]

    Veitch, S

    V. Veitch, S. H. Mousavian, D. Gottesman, and J. Emer- son, New Journal of Physics16, 013009 (2014)

  59. [59]

    Turkeshi, E

    X. Turkeshi, E. Tirrito, and P. Sierant, Nature Commu- nications16, 2575 (2025)

  60. [60]

    Leone, S

    L. Leone, S. F. Oliviero, Y. Zhou, and A. Hamma, Quan- tum5, 453 (2021)

  61. [61]

    Z. Yao, A. Gholami, S. Shen, M. Mustafa, K. Keutzer, and M. Mahoney, inproceedings of the AAAI conference on artificial intelligence, Vol. 35 (2021) pp. 10665–10673

  62. [62]

    Brunner, D

    N. Brunner, D. Cavalcanti, S. Pironio, V. Scarani, and S. Wehner, Reviews of modern physics86, 419 (2014)

  63. [63]

    R. Uola, A. C. Costa, H. C. Nguyen, and O. G¨ uhne, Reviews of Modern Physics92, 015001 (2020)

  64. [64]

    A. Bera, T. Das, D. Sadhukhan, S. Singha Roy, A. Sen, and U. Sen, Reports on Progress in Physics81, 024001 (2018)

  65. [65]

    Horodecki, P

    R. Horodecki, P. Horodecki, M. Horodecki, and K. Horodecki, Reviews of modern physics81, 865 (2009)

  66. [66]

    Facco, M

    E. Facco, M. d’Errico, A. Rodriguez, and A. Laio, Scien- tific reports7, 12140 (2017)

  67. [67]

    Weimar, L

    M. Weimar, L. M. Rachbauer, I. Starshynov, D. Faccio, L. Adilova, D. Bouchet, and S. Rotter, Physical Review X15, 031072 (2025)

  68. [68]

    Kamkari, B

    H. Kamkari, B. L. Ross, R. Hosseinzadeh, J. C. Cresswell, and G. Loaiza-Ganem, Advances in Neural Information Processing Systems37, 38307 (2024)

  69. [69]

    Tempczyk, R

    P. Tempczyk, R. Michaluk, L. Garncarek, P. Spurek, J. Tabor, and A. Golinski, inInternational Conference on Machine Learning(PMLR, 2022) pp. 21205–21231

  70. [70]

    Passetti, D

    G. Passetti, D. Hofmann, P. Neitemeier, L. Grunwald, M. A. Sentef, and D. M. Kennes, Physical Review Letters 131, 036502 (2023)

  71. [71]

    Denis, A

    Z. Denis, A. Sinibaldi, and G. Carleo, Physical Review Letters134, 079701 (2025)

  72. [72]

    Bukov, A

    M. Bukov, A. G. Day, D. Sels, P. Weinberg, A. Polkovnikov, and P. Mehta, Physical Review X8, 031086 (2018)

  73. [73]

    Valenti, E

    A. Valenti, E. van Nieuwenburg, S. Huber, and E. Gre- plova, Physical Review Research1, 033092 (2019)