Projected Inverse Iteration: An Eigenvalue Approach to Ground-State Computation with Neural Quantum States
Pith reviewed 2026-06-27 21:29 UTC · model grok-4.3
The pith
Reframing neural quantum state optimization as an eigenvalue problem yields gap-insensitive convergence at polynomial cost.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Projected Inverse Iteration reframes variational ground-state search for neural quantum states as an eigenvalue problem, applying inverse iteration in a projected manner that decouples convergence speed from spectral gap size while preserving the polynomial scaling of stochastic reconfiguration.
What carries the argument
Projected Inverse Iteration (PII), which solves the ground-state eigenvalue problem by inverting the Hamiltonian and projecting the resulting updates onto the tangent space of the neural network wavefunction manifold.
If this is right
- Enables reliable optimization on frustrated systems with small gaps such as the J1-J2 model.
- Maintains the same per-iteration cost scaling as stochastic reconfiguration.
- Opens a route to treat other deep-learning eigenvalue tasks as natural-gradient problems.
Where Pith is reading between the lines
- The method could be tested on three-dimensional or larger lattices to check whether the gap independence persists at scale.
- Similar projected inverse steps might accelerate other variational optimizations that reduce to eigenvalue searches.
- Applications to strongly correlated electron models could be explored by combining PII with existing neural architectures.
Load-bearing premise
That the eigenvalue reformulation can be carried out for neural wavefunctions without creating new computational bottlenecks or sacrificing the polynomial scaling of stochastic reconfiguration.
What would settle it
A controlled numerical experiment on a tunable-gap model where PII convergence time increases as the gap is deliberately reduced.
read the original abstract
Deep learning offers a powerful approach to quantum many-body problems via neural network wavefunctions, but their optimization remains a severe bottleneck. Existing optimization methods, including natural gradient descent and stochastic reconfiguration, suffer from spectral gap-dependent convergence that limits their effectiveness on systems fraught with competing orders and nearly degenerate ground states, such as frustrated magnets and strongly correlated electron materials. Here, we introduce Projected Inverse Iteration (PII) by re-framing the ground-state search as an eigenvalue problem. PII achieves rapid, gap-insensitive convergence while preserving the favorable polynomial computational scaling of stochastic reconfiguration. Demonstrated on challenging two-dimensional spin systems, including the highly frustrated $J_1$-$J_2$ model, PII outperforms standard optimization techniques and presents a promising algorithmic strategy for discovering complex quantum states in the presence of small spectral gaps. More broadly, PII can be interpreted as a novel natural gradient method tailored for eigenvalue problems, opening up its application to related challenges within deep learning.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces Projected Inverse Iteration (PII) by recasting ground-state search for neural quantum states as an eigenvalue problem. It claims that PII delivers rapid, gap-insensitive convergence while retaining the polynomial computational scaling of stochastic reconfiguration, and demonstrates outperformance versus standard optimizers on two-dimensional frustrated spin models including the J1-J2 Heisenberg model.
Significance. If the gap-insensitivity and scaling claims hold, the work would address a central practical limitation of natural-gradient and SR methods on systems with small gaps, such as frustrated magnets. The framing as a natural-gradient method specialized to eigenvalue problems could also extend to other variational optimization settings.
major comments (1)
- [Algorithm description and complexity analysis] The central claim that PII preserves the favorable polynomial scaling of SR while achieving gap-insensitive convergence is load-bearing, yet the manuscript provides no explicit operation-count analysis or pseudocode showing that the manifold projection and approximate inverse iteration steps incur only a constant-factor overhead relative to a single SR iteration (independent of gap size). This directly engages the weakest assumption identified in the stress-test note.
Simulated Author's Rebuttal
We thank the referee for their careful reading of the manuscript and for identifying the need for a more explicit complexity analysis. We address the major comment below.
read point-by-point responses
-
Referee: [Algorithm description and complexity analysis] The central claim that PII preserves the favorable polynomial scaling of SR while achieving gap-insensitive convergence is load-bearing, yet the manuscript provides no explicit operation-count analysis or pseudocode showing that the manifold projection and approximate inverse iteration steps incur only a constant-factor overhead relative to a single SR iteration (independent of gap size). This directly engages the weakest assumption identified in the stress-test note.
Authors: We agree that the manuscript would benefit from an explicit operation-count analysis and pseudocode. The PII procedure augments a standard SR step with a manifold projection (via QR or SVD on the parameter update) and an approximate inverse iteration (solving a small linear system whose size is set by the number of variational parameters, independent of the spectral gap). These steps add only a constant-factor overhead whose leading term is O(N_p^3) for N_p parameters, identical in scaling to the dominant SR matrix inversion. In the revised manuscript we will insert a new subsection containing (i) pseudocode for the full PII iteration and (ii) a detailed flop-count table confirming that the gap-independent overhead remains O(1) relative to one SR iteration. revision: yes
Circularity Check
No significant circularity in derivation chain
full rationale
The paper introduces Projected Inverse Iteration as a reframing of ground-state search into an eigenvalue problem for NQS, asserting gap-insensitive convergence and preserved polynomial scaling of SR. No quoted equations or steps reduce the central claims to self-definitional loops, fitted inputs renamed as predictions, or load-bearing self-citations. The method is presented as a novel algorithmic construction whose properties are demonstrated on models rather than derived tautologically from prior results or parameters. The scaling assertion is a design claim, not a reduction by construction. This is a self-contained algorithmic proposal.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
On the momentum term in gradient descent learning algorithms.Neu- ral Networks12, 145–151 (1999)
Qian, N. On the momentum term in gradient descent learning algorithms.Neu- ral Networks12, 145–151 (1999). URL https://www.sciencedirect.com/science/ article/pii/S0893608098001166
1999
-
[2]
Kingma, D. P. & Ba, J. Bengio, Y. & LeCun, Y. (eds)Adam: A method for stochastic optimization. (eds Bengio, Y. & LeCun, Y.)International Conference on Learning Representations (ICLR)(2015). URL https://arxiv.org/abs/1412. 6980. 31
2015
-
[3]
& Grosse, R
Martens, J. & Grosse, R. Bach, F. & Blei, D. (eds)Optimizing neural net- works with kronecker-factored approximate curvature. (eds Bach, F. & Blei, D.) Proceedings of the 32nd International Conference on Machine Learning, Vol. 37 ofProceedings of Machine Learning Research, 2408–2417 (PMLR, Lille, France, 2015). URL https://proceedings.mlr.press/v37/martens15.html
2015
-
[4]
& Singer, Y
Gupta, V., Koren, T. & Singer, Y. Dy, J. & Krause, A. (eds)Shampoo: Preconditioned stochastic tensor optimization. (eds Dy, J. & Krause, A.)Pro- ceedings of the 35th International Conference on Machine Learning, Vol. 80 of Proceedings of Machine Learning Research, 1842–1850 (PMLR, 2018). URL https://proceedings.mlr.press/v80/gupta18a.html
2018
-
[5]
Vyas, N.et al.Yue, Y., Garg, A., Peng, N., Sha, F. & Yu, R. (eds)Soap: Improving and stabilizing shampoo using adam for language modeling. (eds Yue, Y., Garg, A., Peng, N., Sha, F. & Yu, R.)International Conference on Learning Represen- tations, Vol. 2025, 93423–93444 (2025). URL https://proceedings.iclr.cc/paper files/paper/2025/file/e988664070e9591f93fd...
2025
-
[6]
Carleo, G. & Troyer, M. Solving the quantum many-body problem with artificial neural networks.Science355, 602–606 (2017). URL https://www.science.org/ doi/abs/10.1126/science.aag2302
-
[7]
Science354(6317), 1240–1241 (2016) https://doi.org/10.1126/science
Wu, D.et al.Variational benchmarks for quantum many-body problems.Science 386, 296–301 (2024). URL https://www.science.org/doi/abs/10.1126/science. adg9774
-
[8]
Astrakhantsev, N.et al.Broken-symmetry ground states of the heisenberg model on the pyrochlore lattice.Phys. Rev. X11, 041021 (2021). URL https://link. aps.org/doi/10.1103/PhysRevX.11.041021
-
[9]
Hibat-Allah, M., Ganahl, M., Hayward, L. E., Melko, R. G. & Carrasquilla, J. Recurrent neural network wave functions.Phys. Rev. Res.2, 023358 (2020). URL https://link.aps.org/doi/10.1103/PhysRevResearch.2.023358
-
[10]
S., Wiersema, R., Hibat-Allah, M., Carrasquilla, J
Moss, M. S., Wiersema, R., Hibat-Allah, M., Carrasquilla, J. & Melko, R. G. Leveraging recurrence in neural network wavefunctions for large-scale simulations of heisenberg antiferromagnets on the square lattice.Phys. Rev. B112, 134450 (2025). URL https://link.aps.org/doi/10.1103/6ccd-wzhz
-
[11]
Pescia, G., Nys, J., Kim, J., Lovato, A. & Carleo, G. Message-passing neural quantum states for the homogeneous electron gas.Phys. Rev. B110, 035108 (2024). URL https://link.aps.org/doi/10.1103/PhysRevB.110.035108
-
[12]
Viteritti, L. L., Rende, R. & Becca, F. Transformer variational wave functions for frustrated quantum spin systems.Phys. Rev. Lett.130, 236401 (2023). URL https://link.aps.org/doi/10.1103/PhysRevLett.130.236401. 32
-
[13]
& Heyl, M
Chen, A. & Heyl, M. Empowering deep neural quantum states through efficient optimization.Nature Physics20, 1476–1481 (2024). URL https://doi.org/10. 1038/s41567-024-02566-1
2024
-
[14]
& Carleo, G
Denis, Z. & Carleo, G. Accurate neural quantum states for interacting lattice bosons.Quantum9, 1772 (2025). URL https://doi.org/10.22331/ q-2025-06-17-1772
2025
-
[15]
Linteau, D., Pescia, G., Nys, J., Carleo, G. & Holzmann, M. Phase diagram and crystal melting of helium-4 in two dimensions.Phys. Rev. Lett.134, 246001 (2025). URL https://link.aps.org/doi/10.1103/v1g7-m9k4
-
[16]
S., Matthews, A
Pfau, D., Spencer, J. S., Matthews, A. G. D. G. & Foulkes, W. M. C. Ab initio solution of the many-electron schr¨ odinger equation with deep neural net- works.Phys. Rev. Res.2, 033429 (2020). URL https://link.aps.org/doi/10.1103/ PhysRevResearch.2.033429
2020
-
[17]
Hermann, J., Sch¨ atzle, Z. & No´ e, F. Deep-neural-network solution of the elec- tronic schr¨ odinger equation.Nature Chemistry12, 891–897 (2020). URL https://doi.org/10.1038/s41557-020-0544-y
-
[18]
URL https://arxiv.org/abs/2506.19960
Foster, A.et al.An ab initio foundation model of wavefunctions that accurately describes chemical bond breaking (2025). URL https://arxiv.org/abs/2506.19960. arXiv:2506.19960
arXiv 2025
-
[19]
von Glehn, I., Spencer, J. S. & Pfau, D. A self-attention ansatz for ab-initio quan- tum chemistry (2023). URL https://arxiv.org/abs/2211.13672. arXiv:2211.13672
arXiv 2023
-
[20]
Scherbela, M., Gerard, L. & Grohs, P. Towards a transferable fermionic neural wavefunction for molecules.Nature Communications15, 120 (2024). URL https: //doi.org/10.1038/s41467-023-44216-9
-
[21]
URL https://doi.org/10.1038/s42256-024-00794-x
Li, R.et al.A computational framework for neural network-based variational monte carlo with forward laplacian.Nature Machine Intelligence6, 209–219 (2024). URL https://doi.org/10.1038/s42256-024-00794-x
-
[22]
R., Carleo, G., Georges, A
Moreno, J. R., Carleo, G., Georges, A. & Stokes, J. Fermionic wave functions from neural-network constrained hidden states.Proceedings of the National Academy of Sciences119, e2122059119 (2022). URL https://www.pnas.org/doi/abs/10. 1073/pnas.2122059119
2022
-
[23]
Communications Physics7, 148 (2024)
Kim, J.et al.Neural-network quantum states for ultra-cold fermi gases. Communications Physics7, 148 (2024). URL https://doi.org/10.1038/ s42005-024-01613-w
2024
-
[24]
T.et al.Neural wave functions for superfluids.Phys
Lou, W. T.et al.Neural wave functions for superfluids.Phys. Rev. X14, 021030 (2024). URL https://link.aps.org/doi/10.1103/PhysRevX.14.021030. 33
-
[25]
Pfau, D., Axelrod, S., Sutterud, H., von Glehn, I. & Spencer, J. S. Accurate com- putation of quantum excited states with neural networks.Science385, eadn0137 (2024). URL https://www.science.org/doi/abs/10.1126/science.adn0137
-
[26]
Sorella, S. Green function monte carlo with stochastic reconfiguration.Phys. Rev. Lett.80, 4558–4561 (1998). URL https://link.aps.org/doi/10.1103/PhysRevLett. 80.4558
-
[27]
L., Bardone, L., Becca, F
Rende, R., Viteritti, L. L., Bardone, L., Becca, F. & Goldt, S. A sim- ple linear algebra identity to optimize large-scale neural network quantum states.Communications Physics7, 260 (2024). URL https://doi.org/10.1038/ s42005-024-01732-4
2024
-
[28]
(eds Oh, A
Neklyudov, K.et al.Oh, A.et al.(eds)Wasserstein quantum monte carlo: A novel approach for solving the quantum many-body schr¨ odinger equation. (eds Oh, A. et al.)Advances in Neural Information Processing Systems, Vol. 36, 63461–63482 (Curran Associates, Inc., 2023). URL https://proceedings.neurips.cc/paper files/ paper/2023/file/c8450235f227f136242f774b2...
2023
-
[29]
Webber, R. J. & Lindsey, M. Rayleigh-gauss-newton optimization with enhanced sampling for variational monte carlo.Phys. Rev. Res.4, 033099 (2022). URL https://link.aps.org/doi/10.1103/PhysRevResearch.4.033099
-
[30]
& Lin, L
Goldshlager, G., Abrahamsen, N. & Lin, L. A kaczmarz-inspired approach to accelerate the optimization of neural network wavefunctions.Journal of Com- putational Physics516, 113351 (2024). URL https://www.sciencedirect.com/ science/article/pii/S0021999124005990
2024
-
[31]
Drissi, M., Keeble, J. W. T., Rozal´ en Sarmiento, J. & Rios, A. Second-order optimization strategies for neural network quantum states.Philosophical Trans- actions of the Royal Society A: Mathematical, Physical and Engineering Sciences 382, 20240057 (2024). URL https://doi.org/10.1098/rsta.2024.0057
- [32]
-
[33]
URL https://arxiv.org/abs/2508.02570
Jiang, D.et al.Neural scaling laws surpass chemical accuracy for the many- electron schr¨ odinger equation (2025). URL https://arxiv.org/abs/2508.02570. arXiv:2508.02570
arXiv 2025
-
[34]
URL https://arxiv.org/abs/2507.02644
Gu, Y.et al.Solving the hubbard model with neural quantum states (2025). URL https://arxiv.org/abs/2507.02644. arXiv:2507.02644
arXiv 2025
-
[35]
& Zeinhofer, M
M¨ uller, J. & Zeinhofer, M. Salakhutdinov, R.et al.(eds)Position: Optimization in SciML should employ the function space geometry. (eds Salakhutdinov, R.et al.) Proceedings of the 41st International Conference on Machine Learning, Vol. 235 34 ofProceedings of Machine Learning Research, 36705–36722 (PMLR, 2024). URL https://proceedings.mlr.press/v235/mull...
2024
-
[36]
Trefethen, L. N. & Bau, D.Numerical Linear Algebra(SIAM, 2022)
2022
-
[37]
Jiang, H.-C., Yao, H. & Balents, L. Spin liquid ground state of the spin- 1 2 square J1-J2 heisenberg model.Phys. Rev. B86, 024424 (2012). URL https://link.aps. org/doi/10.1103/PhysRevB.86.024424
-
[38]
N., Motrunich, O
Gong, S.-S., Zhu, W., Sheng, D. N., Motrunich, O. I. & Fisher, M. P. A. Plaquette ordered phase and quantum phase diagram in the spin-1 2 J1−J2 square heisenberg model.Phys. Rev. Lett.113, 027201 (2014). URL https://link.aps.org/doi/10. 1103/PhysRevLett.113.027201
2014
-
[39]
& Becca, F
Ferrari, F. & Becca, F. Gapless spin liquid and valence-bond solid in theJ 1-J2 heisenberg model on the square lattice: Insights from singlet and triplet excita- tions.Phys. Rev. B102, 014417 (2020). URL https://link.aps.org/doi/10.1103/ PhysRevB.102.014417
2020
-
[40]
Richter, J., Zinke, R. & Farnell, D. J. J. The spin-1/2 square-lattice j1-j2 model: the spin-gap issue.The European Physical Journal B88, 2 (2015). URL https: //doi.org/10.1140/epjb/e2014-50589-x
-
[41]
Boumal, N.An introduction to optimization on smooth manifolds(Cambridge University Press, 2023)
2023
-
[42]
Stokes, J., Izaac, J., Killoran, N. & Carleo, G. Quantum Natural Gradient. Quantum4, 269 (2020). URL https://doi.org/10.22331/q-2020-05-25-269
-
[43]
Toulouse, J. & Umrigar, C. J. Optimization of quantum monte carlo wave func- tions by energy minimization.The Journal of Chemical Physics126, 084102 (2007). URL https://doi.org/10.1063/1.2437215
-
[44]
URL http://github.com/jax-ml/jax
Bradbury, J.et al.JAX: composable transformations of Python+NumPy programs (2018). URL http://github.com/jax-ml/jax
2018
- [45]
- [46]
-
[47]
Absil, P.-A., Mahony, R. & Sepulchre, R.Optimization Algorithms on Matrix Manifolds(Princeton University Press, Princeton, 2009). URL https://doi.org/ 10.1515/9781400830244. 35
-
[48]
& Sepulchre, R.Optimization algorithms on matrix manifolds(Princeton University Press, 2008)
Absil, P.-A., Mahony, R. & Sepulchre, R.Optimization algorithms on matrix manifolds(Princeton University Press, 2008)
2008
-
[49]
Codebases7 (2022)
Vicentini, F.et al.NetKet 3: Machine Learning Toolbox for Many-Body Quantum Systems.SciPost Phys. Codebases7 (2022). URL https://scipost.org/10.21468/ SciPostPhysCodeb.7
2022
-
[50]
URL https://www.sciencedirect.com/ science/article/pii/S2352711019300974
Carleo, G.et al.Netket: A machine learning toolkit for many-body quantum systems.SoftwareX10, 100311 (2019). URL https://www.sciencedirect.com/ science/article/pii/S2352711019300974. 36
2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.