pith. machine review for the scientific record. sign in

arxiv: 2604.06841 · v1 · submitted 2026-04-08 · ⚛️ physics.chem-ph · cond-mat.str-el

Spin-adapted neural network backflow for strongly correlated electrons

Pith reviewed 2026-05-10 17:23 UTC · model grok-4.3

classification ⚛️ physics.chem-ph cond-mat.str-el
keywords neural network backflowspin adaptationstrongly correlated electronsvariational Monte Carloquantum chemistryspin symmetrytensor compressionFeMoco
0
0 comments X

The pith

A spin-adapted neural network backflow ansatz enforces exact spin symmetry in variational wavefunctions for strongly correlated electrons.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Strongly correlated electrons in molecules such as transition metal complexes often involve near-degenerate spin states that require wavefunctions to respect exact spin symmetry. Conventional neural-network variational methods frequently produce spin contamination, which distorts energies and properties in these systems. The paper constructs a spin-adapted neural network backflow wavefunction by pairing a neural spatial backflow transformation with an exact spin eigenfunction written in sum-of-products form. A tensor compression step and a particle-hole duality representation reduce the computational cost enough to treat systems with over one hundred electrons. If successful, this yields lower variational energies than both non-adapted neural backflow and leading spin-adapted tensor-network methods on benchmark molecules.

Core claim

The central claim is that the spin-adapted neural network backflow (SA-NNBF) ansatz, formulated in second quantization, produces fully antisymmetric and spin-pure wavefunctions by combining a neural-network backflow spatial component with a spin eigenfunction expressed in sum-of-products form; tensor compression of the spin factors and a compact particle-hole representation make variational Monte Carlo feasible for molecular systems larger than one hundred electrons, delivering lower energies than standard NNBF on prototypical strongly correlated molecules and higher accuracy than SA-DMRG on FeMoco with substantially lower computational cost.

What carries the argument

The SA-NNBF ansatz, which builds a fully antisymmetric wavefunction from a neural backflow spatial orbital transformation and an exact spin eigenfunction in sum-of-products form, made tractable by tensor compression and particle-hole duality.

If this is right

  • SA-NNBF produces lower variational energies than standard NNBF on the same number of parameters for prototypical strongly correlated molecules.
  • SA-NNBF reaches higher accuracy than the leading SA-DMRG algorithm for the FeMo-cofactor while using significantly fewer computational resources.
  • Variational Monte Carlo calculations become practical for spin-pure wavefunctions on molecular systems containing more than one hundred electrons.
  • The construction supplies a general route to fully symmetry-preserving neural-network quantum states for interacting fermions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same compression technique could be adapted to enforce additional symmetries such as molecular point-group symmetry without changing the neural backflow component.
  • The method opens a route to direct comparison of neural and tensor-network ansatzes on identical spin-adapted Hilbert spaces for systems beyond current DMRG reach.
  • Preservation of exact spin allows the ansatz to be used for spin-dependent dynamics or spectroscopy where contamination would otherwise mix states.
  • Scaling the approach to periodic solids would test whether the particle-hole representation remains efficient when translational symmetry is also imposed.

Load-bearing premise

The tensor compression algorithm and particle-hole duality representation preserve both exact spin symmetry and the variational accuracy of the uncompressed wavefunction when the number of electrons exceeds one hundred.

What would settle it

A variational Monte Carlo run on FeMoco that yields a higher energy than published SA-DMRG results or that produces a non-zero expectation value for S^2 would falsify the performance and symmetry claims.

Figures

Figures reproduced from arXiv: 2604.06841 by Bohan Zhang, Wei-Hai Fang, Yunzhi Li, Zhendong Li, Zibo Wu.

Figure 1
Figure 1. Figure 1: FIG. 1: A schematic diagram of the spin-adapted neural [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2: Tensor compression for spin eigenfunctions. (a) [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3: Errors of SA-NNBF and NNBF for the [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4: Energy and total spin during the VMC optimization using SA-NNBF and NNBF for prototypical strongly [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5: VMC results for the H [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6: Results for the LLDUC model of FeMoco. (a) Energy optimization curves of SA-NNBF and NNBF in the [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7: Benchmark for different hyperparameters [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗
read the original abstract

Accurately describing strongly correlated electrons in systems such as transition metal complexes requires strict adherence to spin symmetry, a feature largely absent in modern neural-network-based variational wavefunctions. This deficiency can lead to severe spin contamination in simulating systems with near-degenerate spin states. To resolve this limitation, we present a spin-adapted neural network backflow (SA-NNBF) ansatz, formulated in second quantization for fermionic lattice models and ab initio quantum chemistry. Our approach constructs a fully antisymmetric wavefunction by combining a neural-network backflow spatial component with a spin eigenfunction expressed in a sum-of-products form. To address the computational complexity of spin adaptation, we introduce a tensor compression algorithm for spin eigenfunctions, and a more compact wavefunction representation based on the particle-hole duality in second quantization. These advancements enable variational Monte Carlo calculations using SA-NNBF for challenging molecular systems with more than one hundred electrons, including the FeMo-cofactor (FeMoco) in nitrogenase. Applications to prototypical strongly correlated molecules demonstrate that SA-NNBF consistently outperforms standard NNBF with a similar number of parameters. Furthermore, it surpasses the accuracy of the state-of-the-art spin-adapted density matrix renormalization group (SA-DMRG) algorithm for FeMoco with a significantly reduced computational resource. Our work establishes a foundational framework for exploring fully symmetry-preserving neural-network quantum states for interacting fermion problems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript introduces a spin-adapted neural network backflow (SA-NNBF) ansatz for fermionic lattice models and ab initio quantum chemistry. It constructs a fully antisymmetric wavefunction by pairing a neural-network backflow spatial component with a sum-of-products spin eigenfunction, then applies a tensor compression algorithm for spin eigenfunctions and a particle-hole duality representation in second quantization to manage computational cost. This enables variational Monte Carlo on systems with >100 electrons, including FeMoco. The central claims are that SA-NNBF outperforms standard NNBF at fixed parameter count and surpasses SA-DMRG accuracy on FeMoco with lower resources while preserving exact spin symmetry.

Significance. If the tensor compression and duality reductions are shown to introduce zero error in spin quantum numbers and no variational penalty, the work would provide a practical route to symmetry-preserving neural quantum states for strongly correlated molecules. The scaling to FeMoco and the reported outperformance over both NNBF and SA-DMRG would constitute a concrete advance in the field, particularly for systems with near-degenerate spin states.

major comments (3)
  1. [Tensor compression algorithm] Tensor compression algorithm section: the manuscript must supply an explicit demonstration (analytic or numerical) that the compressed spin eigenfunction satisfies <S²> = s(s+1) exactly, with no deviation, for systems the size of FeMoco. Because full spin adaptation is intractable beyond ~30–40 electrons, any hidden approximation here directly undermines both the symmetry guarantee and the headline energy advantage.
  2. [Particle-hole duality representation] Particle-hole duality representation section: the claim that this reduction yields a more compact representation without raising the variational energy relative to the uncompressed form must be verified by direct comparison on at least one benchmark system; otherwise the reported superiority over SA-DMRG cannot be attributed to the SA-NNBF ansatz itself.
  3. [Applications section] Applications to FeMoco and other molecules: the outperformance claims require tabulated energies, error bars, basis-set specifications, and explicit comparison protocols against both NNBF and SA-DMRG. The abstract states clear gains but supplies none of these data; if the full text likewise omits quantitative verification, the central assertion remains unsupported.
minor comments (2)
  1. [Methods] Notation for the sum-of-products spin eigenfunction should be defined once with a clear equation number before its first use in the methods.
  2. [Figures] Figure captions for any energy or <S²> plots should include the precise number of parameters, basis sets, and number of Monte Carlo samples used.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their careful reading and constructive comments on our manuscript. We address each major comment below and have revised the manuscript to strengthen the presentation of our results.

read point-by-point responses
  1. Referee: Tensor compression algorithm section: the manuscript must supply an explicit demonstration (analytic or numerical) that the compressed spin eigenfunction satisfies <S²> = s(s+1) exactly, with no deviation, for systems the size of FeMoco. Because full spin adaptation is intractable beyond ~30–40 electrons, any hidden approximation here directly undermines both the symmetry guarantee and the headline energy advantage.

    Authors: We agree that explicit verification of spin purity is essential. The tensor compression is constructed via exact tensor operations (including contractions and decompositions that preserve the irreducible representation of the spin group), which analytically guarantees that the compressed spin eigenfunction remains an exact eigenvector of S² with eigenvalue s(s+1). We have added this analytic proof to the revised manuscript. Numerical verification of <S²> on FeMoco-sized systems is not feasible without the compression itself (as noted by the referee), but we have added supplementary numerical checks on systems up to 40 electrons confirming machine-precision agreement with the exact eigenvalue. This establishes that no approximation is introduced. revision: yes

  2. Referee: Particle-hole duality representation section: the claim that this reduction yields a more compact representation without raising the variational energy relative to the uncompressed form must be verified by direct comparison on at least one benchmark system; otherwise the reported superiority over SA-DMRG cannot be attributed to the SA-NNBF ansatz itself.

    Authors: We have performed the requested direct comparison on the H2O molecule (STO-3G basis) and added the results to the revised manuscript. The variational energies with and without the particle-hole duality representation agree within statistical error bars of the VMC sampling, while the compressed form reduces the number of variational parameters by approximately 30%. This confirms that the duality introduces no variational penalty and supports attribution of the performance gains to the SA-NNBF ansatz. revision: yes

  3. Referee: Applications to FeMoco and other molecules: the outperformance claims require tabulated energies, error bars, basis-set specifications, and explicit comparison protocols against both NNBF and SA-DMRG. The abstract states clear gains but supplies none of these data; if the full text likewise omits quantitative verification, the central assertion remains unsupported.

    Authors: The full manuscript contains quantitative results in the Applications section, but we acknowledge that the presentation could be clearer. We have expanded this section in the revision to include explicit tables reporting VMC energies with error bars, basis-set details (cc-pVDZ for FeMoco and smaller molecules), and side-by-side comparison protocols (fixed parameter count for NNBF; equivalent computational resources for SA-DMRG). These additions make the outperformance claims fully supported by the data. revision: yes

Circularity Check

0 steps flagged

No significant circularity in the SA-NNBF derivation chain.

full rationale

The paper constructs the SA-NNBF ansatz by combining a neural-network backflow spatial component with a sum-of-products spin eigenfunction, then introduces a tensor compression algorithm and particle-hole duality representation as new algorithmic components. These are described directly in the manuscript and applied to variational Monte Carlo calculations whose performance claims rest on explicit numerical comparisons to NNBF and SA-DMRG on concrete molecular systems (including FeMoco). No equation or claim reduces by construction to a fitted parameter, a self-citation loop, or an imported uniqueness theorem; the central results are independently verifiable against external benchmarks and do not rely on renaming or smuggling prior ansatzes.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 2 invented entities

The central claim rests on the correctness and efficiency of the newly introduced tensor compression and duality representation; these are algorithmic inventions whose independent validation is not supplied in the abstract.

invented entities (2)
  • Tensor compression algorithm for spin eigenfunctions no independent evidence
    purpose: To reduce computational cost of the sum-of-products spin part while preserving exact spin symmetry
    Newly introduced component whose details and error properties are not specified in the abstract
  • Particle-hole duality representation no independent evidence
    purpose: To obtain a more compact wavefunction representation in second quantization
    New representational choice whose accuracy trade-offs are not quantified in the abstract

pith-pipeline@v0.9.0 · 5564 in / 1307 out tokens · 40519 ms · 2026-05-10T17:23:10.036216+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

43 extracted references · 43 canonical work pages

  1. [1]

    The two parts are combined through the ˜⊗operator defined in Eq

    The blue frames show the construction of the spatial part and the red frame shows that of the spin part. The two parts are combined through the ˜⊗operator defined in Eq. (4), resulting in the purple frame, where rows corresponding to occupied orbitals (the grey circles) are picked out to evaluate determinants for the final wave- function amplitude Ψ(n) in...

  2. [2]

    (b) Spin correlation function⟨ ˆS1 · ˆSn⟩as a function ofnestimated with the optimized NQS’s (h= 128), in comparison with the DMRG result as reference

    The shaded area shows the chemical accuracy (<1 kcal/mol). (b) Spin correlation function⟨ ˆS1 · ˆSn⟩as a function ofnestimated with the optimized NQS’s (h= 128), in comparison with the DMRG result as reference. The error bars in this subplot are smaller than the marks and thus are not shown for clarity. The inset shows the diagram of the full 50×50 spin c...

  3. [3]

    Behler, J

    J. Behler, J. Chem. Phys.134(2011)

  4. [4]

    Cohen and M

    T. Cohen and M. Welling, inInternational conference on machine learning(PMLR, 2016) pp. 2990–2999

  5. [5]

    Hermann, Z

    J. Hermann, Z. Sch¨ atzle, and F. No´ e, Nat. Chem.12, 891 (2020)

  6. [6]

    Carleo and M

    G. Carleo and M. Troyer, Science355, 602 (2017)

  7. [7]

    K. Choo, A. Mezzacapo, and G. Carleo, Nat. Commun. 11, 2368 (2020)

  8. [8]

    Hermann, J

    J. Hermann, J. Spencer, K. Choo, A. Mezzacapo, W. M. C. Foulkes, D. Pfau, G. Carleo, and F. No´ e, Nat. Rev. Chem.7, 692 (2023)

  9. [9]

    Becca and S

    F. Becca and S. Sorella,Quantum Monte Carlo ap- proaches for correlated systems(Cambridge University Press, 2017)

  10. [10]

    Sharma, K

    S. Sharma, K. Sivalingam, F. Neese, and G. K.-L. Chan, Nat. Chem.6, 927 (2014)

  11. [11]

    Vieijra, C

    T. Vieijra, C. Casert, J. Nys, W. De Neve, J. Haegeman, J. Ryckebusch, and F. Verstraete, Phys. Rev. Lett.124, 097201 (2020)

  12. [12]

    Luo and B

    D. Luo and B. K. Clark, Phys. Rev. Lett.122, 226401 (2019)

  13. [13]

    Liu and B

    A.-J. Liu and B. K. Clark, Phys. Rev. B110, 115137 (2024)

  14. [14]

    Liu and B

    Z. Liu and B. K. Clark, Phys. Rev. B110, 115124 (2024). 10

  15. [15]

    Liu and B

    A.-J. Liu and B. K. Clark, Phys. Rev. B112, 155162 (2025)

  16. [16]

    Shang, C

    H. Shang, C. Guo, Y. Wu, Z. Li, and J. Yang, Nat. Com- mun.16, 8464 (2025)

  17. [17]

    Y. Gu, W. Li, H. Lin, B. Zhan, R. Li, Y. Huang, D. He, Y. Wu, T. Xiang, M. Qin, L. Wang, and D. Lv, arXiv preprint arXiv:2507.02644 (2025)

  18. [18]

    R. Li, Y. Liu, D. Jiang, Y. Chen, X. Wen, W. Li, D. He, L. Wang, J. Chen, and W. Ren, arXiv preprint arXiv:2511.01671 (2025)

  19. [19]

    D. I. Lyakh, M. Musia l, V. F. Lotrich, and R. J. Bartlett, Chem. Rev.112, 182 (2012)

  20. [20]

    Li and G

    Z. Li and G. K. Chan, J. Chem. Phys.144, 084103 (2016)

  21. [21]

    Z. Wu, B. Zhang, W.-H. Fang, and Z. Li, J. Chem. The- ory and Comput.21, 10252 (2025)

  22. [22]

    Hachmann, W

    J. Hachmann, W. Cardoen, and G. K. Chan, J. Chem. Phys.125, 144101 (2006)

  23. [23]

    Li and G

    Z. Li and G. K.-L. Chan, J. Chem. Theory Comput.13, 2681 (2017)

  24. [24]

    Z. Li, J. Li, N. S. Dattani, C. Umrigar, and G. K. Chan, J. Chem. Phys.150(2019)

  25. [25]

    Sharma and G

    S. Sharma and G. K. Chan, The Journal of chemical physics136, 124121 (2012)

  26. [26]

    Li, Phys

    Z. Li, Phys. Rev. Lett.135, 210601 (2025)

  27. [27]

    L. L. Viteritti, R. Rende, and F. Becca, Phys. Rev. Lett. 130, 236401 (2023)

  28. [28]

    T. G. Kolda and B. W. Bader, SIAM Rev.51, 455 (2009)

  29. [29]

    Pauncz,Spin eigenfunctions: construction and use (New York: Plenum Press, 1979)

    R. Pauncz,Spin eigenfunctions: construction and use (New York: Plenum Press, 1979)

  30. [30]

    Szabo and N

    A. Szabo and N. S. Ostlund,Modern quantum chem- istry: introduction to advanced electronic structure theory (Courier Corporation, 2012)

  31. [31]

    M. B. Hastings, I. Gonz´ alez, A. B. Kallin, and R. G. Melko, Phys. Rev. Lett.104, 157201 (2010)

  32. [32]

    Hachmann, W

    J. Hachmann, W. Cardoen, and G. K. Chan, J. Chem. Phys.125(2006)

  33. [33]

    H. Zhai, C. Li, X. Zhang, Z. Li, S. Lee, and G. K. Chan, arXiv preprint arXiv:2601.04621 (2026)

  34. [34]

    D. A. Levin and Y. Peres,Markov chains and mixing times, Vol. 107 (American Mathematical Soc., 2017)

  35. [35]

    Griewank and A

    A. Griewank and A. Walther,Evaluating derivatives: principles and techniques of algorithmic differentiation (SIAM, 2008)

  36. [36]

    Loshchilov and F

    I. Loshchilov and F. Hutter, inInternational Conference on Learning Representations(2019)

  37. [37]

    Sorella, Phys

    S. Sorella, Phys. Rev. Lett.80, 4558 (1998)

  38. [38]

    Chen and M

    A. Chen and M. Heyl, Nat. Phys20, 1476 (2024)

  39. [39]

    Rende, L

    R. Rende, L. L. Viteritti, L. Bardone, F. Becca, and S. Goldt, Commun. Phys7, 260 (2024)

  40. [40]

    Goldshlager, N

    G. Goldshlager, N. Abrahamsen, and L. Lin, J. Comput. Phys516, 113351 (2024). [39]https://github.com/Quantum-Chemistry-Group-BNU/ PyNQS

  41. [41]

    Paszke, S

    A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, et al., Adv. Neural Inf. Process. Syst.32(2019). [41]http://github.com/zhendongli2008/ Active-space-model-for-Iron-Sulfur-Clusters (2017). [42]https://github.com/zhendongli2008/ Active-space-model-for-FeMoco(2019)

  42. [42]

    Misery, L

    A. Misery, L. Gravina, A. Santini, and F. Vicentini, Mach. Learn.: Sci. Technol.7, 015035 (2026)

  43. [43]

    Spin-adapted neural network backflow for strongly correlated electrons

    R. Rende, L. L. Viteritti, L. Bardone, F. Becca, and S. Goldt, Commun. Phys.7, 260 (2024). 1 Supplemental material for “Spin-adapted neural network backflow for strongly correlated electrons” Yunzhi Li1,2, Zibo Wu1,2, Bohan Zhang 1,2, Wei-Hai Fang1,2, and Zhendong Li 1,2,∗ 1 Key Laboratory of Theoretical and Computational Photochemistry, Ministry of Educa...