Recognition: unknown
Potential Hessian Ascent III: Sampling the Sherrington--Kirkpatrick Model at Beta < 1/2
Pith reviewed 2026-05-07 14:10 UTC · model grok-4.3
The pith
Polynomial-time algorithm samples the Sherrington-Kirkpatrick Gibbs measure with negligible TVD error for all β < 1/2.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that finite-time potential Hessian ascent implements algorithmic stochastic localization for the SK model, yields an O(1) KL bound on the localized measure via a free-probability argument on the diagonal sub-algebra of the Hessian, and can be refined by Jarzynski equality and Glauber dynamics to o(1) TVD error in polynomial time, provided β < 1/2.
What carries the argument
Potential Hessian ascent process, which ascends a smoothed potential while estimating the covariance of the tilted measure through integration by parts and cavity estimates, thereby realizing a finite-time stochastic localization whose KL error remains O(1).
If this is right
- The same Hessian-ascent machinery now supplies both optimization and sampling guarantees up to the same temperature threshold β = 1/2.
- Wasserstein error for the localization step improves from o(n) to O(1), enabling the subsequent KL-to-TVD refinement.
- The algorithm runs in polynomial time and produces samples whose law is arbitrarily close in total variation to the true Gibbs measure.
- The approach demonstrates that stochastic localization can be derandomized and made efficient without requiring n-dependent error terms.
Where Pith is reading between the lines
- If the restricted log-Sobolev inequality extends beyond the SK model, the same localization-plus-refinement pipeline could apply to other mean-field spin glasses.
- The O(1) KL bound suggests that Hessian-based localization may remove the n-dependent overhead that previously limited high-temperature sampling guarantees.
- A natural next step would be to test whether the free-probability argument on the Hessian diagonal can be sharpened to reach β = 1 or beyond.
Load-bearing premise
The restricted log-Sobolev inequality continues to hold on the time-T localized distribution and the free-probability control of the Hessian diagonal sub-algebra produces an n-independent O(1) KL bound for all β < 1/2.
What would settle it
A concrete counter-example or numerical experiment showing that the TVD distance after the full procedure stays bounded away from zero for some fixed β < 1/2 and large n would falsify the claim.
read the original abstract
We give a polynomial-time algorithm to sample from the Gibbs measure of the Sherrington--Kirkpatrick model with negligible total-variation distance (TVD) error up to inverse temperature $\beta < 1/2$. Prior work obtained TVD error guarantees only up to $\beta\approx 0.295$, while results covering the entire replica-symmetric regime $\beta < 1$ gave guarantees only in Wasserstein distance. Our approach demonstrates that the same potential Hessian ascent previously developed for optimization also functions as a sampling algorithm by implementing algorithmic stochastic localization at high temperature. By estimating the covariance of the tilted Gibbs distribution via Gaussian integration by parts, overlap concentration, and precise cavity estimates, we show that a Hessian-ascent process achieves an $O(1)$ Wasserstein error guarantee for finite-time localization, improving on the previous $o(n)$. A careful comparison of stochastic localization with the Hessian ascent process and a free probability argument controlling the diagonal sub-algebra of the Hessian improves this to $O(1)$ in KL divergence. We then use Jarzynski's equality with rejection sampling, along with a restricted log-Sobolev inequality on the time-$T$ localized distribution, to refine the error to $o(1)$ in TVD up to a constant time $T$ and to complete the sampling with Glauber dynamics.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript claims to provide a polynomial-time algorithm that samples from the Gibbs measure of the Sherrington-Kirkpatrick model with o(1) total-variation distance error for all inverse temperatures β < 1/2. The approach uses potential Hessian ascent to perform finite-time stochastic localization, achieving O(1) Wasserstein error via integration by parts, overlap concentration, and cavity estimates. This is upgraded to O(1) KL divergence using a free-probability argument on the Hessian's diagonal sub-algebra. Jarzynski equality combined with rejection sampling and a restricted log-Sobolev inequality on the localized distribution then yields the o(1) TVD guarantee, improving on prior TVD results limited to β ≈ 0.295.
Significance. If the claims hold, this work is significant as it extends rigorous sampling guarantees for the SK model into a larger portion of the replica-symmetric phase (β < 1/2), bridging the gap between previous TVD-limited results and Wasserstein results up to β < 1. The demonstration that Hessian ascent serves dual purposes in optimization and sampling is a valuable insight. The integration of free-probability techniques with stochastic localization and Jarzynski methods offers a promising framework for high-dimensional sampling problems. Strengths include the coherent pipeline using standard spin-glass tools (cavity method, overlap concentration) in a new algorithmic context without parameter fitting.
major comments (3)
- [§4] §4 (free-probability control of Hessian): The argument controlling the diagonal sub-algebra to obtain an O(1) KL bound for the finite-time Hessian-ascent process is load-bearing for upgrading the Wasserstein guarantee. The manuscript must verify that this bound remains O(1) uniformly for all β < 1/2, including explicit dependence on β and confirmation that it does not deteriorate as β approaches 1/2 from below.
- [§6] §6 (restricted log-Sobolev inequality): The restricted LSI is invoked on the time-T localized distribution to reach o(1) TVD. The paper needs to establish that the LSI constant is O(1), independent of n, and uniform in β < 1/2; without this, the polynomial-time claim and final error bound are not fully supported.
- [§5] §5 (Jarzynski equality with rejection): The application of Jarzynski's equality and rejection sampling to refine the error requires explicit bounds on the acceptance probability and its effect on runtime and TVD error to confirm efficiency and that the o(1) TVD holds uniformly up to β < 1/2.
minor comments (2)
- [Abstract] The abstract states 'negligible total-variation distance (TVD) error' but should explicitly note that this means o(1) as n → ∞ to align with the body of the paper.
- [§2] Notation for the potential Hessian ascent process versus standard stochastic localization could be clarified in the introductory sections for improved readability.
Simulated Author's Rebuttal
We thank the referee for the careful reading, positive assessment of significance, and constructive major comments. We address each point below with clarifications on uniformity. The revised manuscript will incorporate explicit bounds and remarks to strengthen the presentation without changing the main claims.
read point-by-point responses
-
Referee: [§4] §4 (free-probability control of Hessian): The argument controlling the diagonal sub-algebra to obtain an O(1) KL bound for the finite-time Hessian-ascent process is load-bearing for upgrading the Wasserstein guarantee. The manuscript must verify that this bound remains O(1) uniformly for all β < 1/2, including explicit dependence on β and confirmation that it does not deteriorate as β approaches 1/2 from below.
Authors: We agree that explicit uniformity is important. The free-probability control in §4 relies on overlap concentration and cavity estimates, which hold uniformly for β < 1/2 (with the replica-symmetric phase extending to β < 1). The resulting KL bound is O(1) with implicit constant C(β) that remains bounded as β → 1/2^− (specifically, C(β) ≤ 8 for β ≤ 0.49 by direct substitution into the variance bounds). We will add an appendix deriving the explicit β-dependence and a short numerical verification of uniformity. This confirms the bound does not deteriorate in the stated regime. revision: yes
-
Referee: [§6] §6 (restricted log-Sobolev inequality): The restricted LSI is invoked on the time-T localized distribution to reach o(1) TVD. The paper needs to establish that the LSI constant is O(1), independent of n, and uniform in β < 1/2; without this, the polynomial-time claim and final error bound are not fully supported.
Authors: The restricted LSI constant on the time-T localized measure is controlled by the minimal eigenvalue of the Hessian, which is bounded below by a positive quantity depending only on β via the same overlap concentration used in §4. This yields an LSI constant ≤ C(β) with C(β) bounded for β < 1/2 and fully independent of n. We will insert a dedicated remark after the LSI statement making this dependence explicit and confirming uniformity, thereby supporting the polynomial runtime. revision: yes
-
Referee: [§5] §5 (Jarzynski equality with rejection): The application of Jarzynski's equality and rejection sampling to refine the error requires explicit bounds on the acceptance probability and its effect on runtime and TVD error to confirm efficiency and that the o(1) TVD holds uniformly up to β < 1/2.
Authors: The acceptance probability under Jarzynski rejection is bounded below by exp(−O(1)) where the O(1) term depends on β but is positive and bounded away from zero uniformly for β < 1/2 (by the O(1) KL control from §4). This preserves the o(1) TVD while multiplying runtime by only a β-dependent constant factor, keeping the algorithm polynomial-time. We will add a short lemma in §5 stating the explicit lower bound on acceptance probability and its effect on the final TVD and runtime. revision: yes
Circularity Check
No significant circularity detected
full rationale
The derivation chain begins with the Hessian-ascent process (imported from prior optimization work) and then derives new O(1) Wasserstein and KL bounds via Gaussian integration by parts, overlap concentration, cavity estimates, and a free-probability argument on the Hessian diagonal sub-algebra. These steps use standard spin-glass and free-probability machinery that are independent of the target sampling result. The subsequent upgrade to o(1) TVD via Jarzynski rejection sampling and a restricted log-Sobolev inequality on the localized measure likewise rests on external facts rather than self-definition, parameter fitting to the output, or load-bearing self-citation chains. No equation reduces to its own input by construction, and the central polynomial-time sampling guarantee at β < 1/2 is not forced by the cited prior work.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Overlap concentration and precise cavity estimates hold for the SK model at β < 1/2
- domain assumption A restricted log-Sobolev inequality holds on the time-T localized distribution
Reference graph
Works this paper leans on
-
[1]
Sampling from
Wang, Chenguang and Cui, Kaiyuan and Zhao, Weichen and Yu, Tianshu , booktitle=. Sampling from. 2025 , organization=
2025
-
[2]
International Conference on Machine Learning , pages=
Sampling from Binary Quadratic Distributions via Stochastic Localization , author=. International Conference on Machine Learning , pages=. 2025 , organization=
2025
-
[3]
2008 , publisher=
Optimization algorithms on matrix manifolds , author=. 2008 , publisher=
2008
-
[4]
17th Conference on the Theory of Quantum Computation, Communication and Cryptography (TQC 2022) , pages=
The Quantum Approximate Optimization Algorithm at High Depth for MaxCut on Large-Girth Regular Graphs and the Sherrington-Kirkpatrick Model , author=. 17th Conference on the Theory of Quantum Computation, Communication and Cryptography (TQC 2022) , pages=. 2022 , organization=
2022
-
[5]
Quantum , volume=
The quantum approximate optimization algorithm and the Sherrington-Kirkpatrick model at infinite size , author=. Quantum , volume=. 2022 , publisher=
2022
-
[6]
Foundations and Trends in Machine Learning , volume=
Graphical models, exponential families, and variational inference , author=. Foundations and Trends in Machine Learning , volume=. 2008 , publisher=
2008
-
[7]
Adhikari, Arka and Brennecke, Christian and. Dynamical. Journal of Statistical Physics , volume =. doi:10.1007/s10955-021-02773-7 , langid =
-
[8]
Proceedings of the 56th Annual ACM Symposium on Theory of Computing , pages=
Semidefinite programs simulate approximate message passing robustly , author=. Proceedings of the 56th Annual ACM Symposium on Theory of Computing , pages=
-
[9]
The Annals of Probability , pages=
The Largest Eigenvalues of Finite Rank Deformation of Large Wigner Matrices: Convergence and Nonuniversality of the Fluctuations , author=. The Annals of Probability , pages=. 2009 , publisher=
2009
-
[10]
Electron
Free convolution with a semicircular distribution and eigenvalues of spiked deformations of Wigner matrices , author=. Electron. J. Probab , volume=
-
[11]
Indiana University Mathematics Journal , pages=
On the free convolution with a semi-circular distribution , author=. Indiana University Mathematics Journal , pages=. 1997 , publisher=
1997
-
[12]
American Mathematical Society , year=
Stochastic calculus: An introduction with applications , author=. American Mathematical Society , year=
-
[13]
Advances in Mathematics , volume=
Rigidity of eigenvalues of generalized Wigner matrices , author=. Advances in Mathematics , volume=. 2012 , publisher=
2012
-
[14]
Bai, Z. D. and Yin, Y. Q. , year = 1993, journal =. Limit of the
1993
-
[15]
Probability Theory and Related Fields , volume =
Spectral gap estimates for mixed p -spin models at high temperature , author =. Probability Theory and Related Fields , volume =. doi:10.1007/s00440-024-01261-9 , langid =
-
[16]
Brennecke, Christian and Xu, Changji and Yau, Horng-Tzer , year = 2025, number =. Operator. doi:10.48550/arXiv.2307.12535 , archiveprefix =. 2307.12535 , primaryclass =
-
[17]
Akhiezer, N. I. and Glazman, I. M. , year = 1993, publisher =. Theory of Linear Operators in
1993
-
[18]
The Annals of Probability , volume =
Optimization of Mean-Field Spin Glasses , author =. The Annals of Probability , volume =
-
[19]
arXiv preprint arXiv:2410.02711 , year =
Nets: A non-equilibrium transport sampler , author =. arXiv preprint arXiv:2410.02711 , year =
-
[20]
Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing , author =
Entropic independence: optimal mixing of down-up random walks , shorttitle =. Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing , author =. doi:10.1145/3519935.3520048 , isbn =
-
[21]
Anari, Nima and Koehler, Frederic and Vuong, Thuy-Duong , year = 2024, series =. Trickle-. Proceedings of the 56th. doi:10.1145/3618260.3649622 , isbn =
-
[22]
2010 , publisher =
An introduction to random matrices , author =. 2010 , publisher =
2010
-
[23]
Aubrun, Guillaume , editor =. A. S\'eminaire de. doi:10.1007/978-3-540-31449-3_22 , isbn =
-
[24]
Bakry, D. and. doi:10.1007/BFb0075847 , isbn =
-
[25]
Inventiones mathematicae , volume =
Matrix Concentration Inequalities and Free Probability , author =. Inventiones mathematicae , volume =. doi:10.1007/s00222-023-01204-6 , langid =
-
[26]
A Very Simple Proof of the
Bauerschmidt, Roland and Bodineau, Thierry , year = 2019, journal =. A Very Simple Proof of the
2019
-
[27]
2005 , publisher =
Complex analysis methods in noncommutative probability , author =. 2005 , publisher =
2005
-
[28]
Nearly d -linear convergence bounds for diffusion models via stochastic localization , author =. arXiv preprint arXiv:2308.03686 , year =
-
[29]
Mathematische Zeitschrift , volume =
Processes with Free Increments , author =. Mathematische Zeitschrift , volume =. doi:10.1007/PL00004363 , langid =
-
[30]
Boucheron, Stéphane and Lugosi, Gábor and Massart, Pascal , title =. 2013 , isbn =. doi:10.1093/acprof:oso/9780199535255.001.0001 , url =
work page doi:10.1093/acprof:oso/9780199535255.001.0001 2013
-
[31]
2016 , publisher =
Metastability: a potential-theoretic approach , author =. 2016 , publisher =
2016
-
[32]
Brennecke, Christian and Schertzer, Adrien and Xu, Changji and Yau, Horng-Tzer , year = 2024, journal =. The Two Point Function of the. doi:10.2140/pmp.2024.5.131 , langid =
-
[33]
Cadel, Agnese and Rovira, Carles , year = 2010, journal =. The. 44239929 , eprinttype =
2010
-
[34]
Localization Schemes:
Chen, Yuansi and Eldan, Ronen , year = 2025, journal =. Localization Schemes:
2025
-
[35]
Sudakov--
Celentano, Michael , year = 2024, journal =. Sudakov--
2024
-
[36]
Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions , author =. arXiv preprint arXiv:2209.11215 , year =
-
[37]
Improved Analysis of Score-Based Generative Modeling: User-Friendly Bounds under Minimal Smoothness Assumptions , shorttitle =
Chen, Hongrui and Lee, Holden and Lu, Jianfeng , year = 2023, series =. Improved Analysis of Score-Based Generative Modeling: User-Friendly Bounds under Minimal Smoothness Assumptions , shorttitle =. Proceedings of the 40th
2023
-
[38]
Proceedings of the National Academy of Sciences , volume=
The overlap gap property: A topological barrier to optimizing over random structures , author=. Proceedings of the National Academy of Sciences , volume=. 2021 , publisher=
2021
-
[39]
The Interaction between Multioverlaps in the High Temperature Phase of the
Crawford, Nicholas , year = 2008, journal =. The Interaction between Multioverlaps in the High Temperature Phase of the
2008
-
[40]
2019 , publisher =
Probability: theory and examples , author =. 2019 , publisher =
2019
-
[41]
Proceedings of Symposia in Applied Mathematics , author =
Computing the volume of convex bodies: a case where randomness provably helps , shorttitle =. Proceedings of Symposia in Applied Mathematics , author =. doi:10.1090/psapm/044/1141926 , isbn =
-
[42]
Sampling from the
Alaoui, Ahmed El and Montanari, Andrea and Sellke, Mark , year = 2022, pages =. Sampling from the. 2022
2022
-
[43]
Bounds on the Covariance Matrix of the
Alaoui, Ahmed El and Gaitonde, Jason , year = 2024, journal =. Bounds on the Covariance Matrix of the
2024
-
[44]
Eldan, Ronen , year = 2013, journal =. Thin
2013
-
[45]
Probability Theory and Related Fields , volume =
Taming Correlations through Entropy-Efficient Measure Decompositions with Applications to Mean-Field Approximation , author =. Probability Theory and Related Fields , volume =. doi:10.1007/s00440-019-00924-2 , langid =
-
[46]
A Spectral Condition for Spectral Gap: Fast Mixing in High-Temperature
Eldan, Ronen and Koehler, Frederic and Zeitouni, Ofer , year = 2022, journal =. A Spectral Condition for Spectral Gap: Fast Mixing in High-Temperature. doi:10.1007/s00440-021-01085-x , langid =
-
[47]
Alaoui, Ahmed El and Montanari, Andrea and Sellke, Mark , year =. Sampling from Mean-Field. Probability and Mathematical Physics , volume =. doi:10.2140/pmp.2025.6.961 , langid =
-
[48]
Statistical Mechanics of Lattice Systems: A Concrete Mathematical Introduction , shorttitle =
Friedli, Sacha and Velenik, Yvan , year = 2017, edition =. Statistical Mechanics of Lattice Systems: A Concrete Mathematical Introduction , shorttitle =. doi:10.1017/9781316882603 , isbn =
-
[49]
Complexity analysis of normalizing constant estimation: from
Guo, Wei and Tao, Molei and Chen, Yongxin , journal =. Complexity analysis of normalizing constant estimation: from
-
[50]
arXiv preprint arXiv:2404.15651 , year =
Sampling from spherical spin glasses in total variation via algorithmic stochastic localization , author =. arXiv preprint arXiv:2404.15651 , year =
-
[51]
Huang, Brice and Mohanty, Sidhanth and Rajaraman, Amit and Wu, David X. , year = 2025, series =. Weak. Proceedings of the 57th. doi:10.1145/3717823.3718236 , isbn =
-
[52]
, year = 1959, journal =
Hubbard, J. , year = 1959, journal =. Calculation of
1959
-
[53]
Continuous Martingales and Brownian Motion
Revuz, Daniel and Yor, Marc. Continuous Martingales and Brownian Motion
-
[54]
, year = 1997, journal =
Jarzynski, C. , year = 1997, journal =. Nonequilibrium
1997
-
[55]
Probability theory and related fields , volume=
The adaptive interpolation method: a simple scheme to prove replica formulas in Bayesian inference , author=. Probability theory and related fields , volume=. 2019 , publisher=
2019
-
[56]
arXiv preprint arXiv:2506.19940 , year=
Strong convergence to operator-valued semicirculars , author=. arXiv preprint arXiv:2506.19940 , year=
-
[57]
Infinite Dimensional Analysis, Quantum Probability and Related Topics , volume=
Strong convergence for reduced free products , author=. Infinite Dimensional Analysis, Quantum Probability and Related Topics , volume=. 2016 , publisher=
2016
-
[58]
Stochastic Processes and their Applications , volume=
On the concavity of the TAP free energy in the SK model , author=. Stochastic Processes and their Applications , volume=. 2023 , publisher=
2023
-
[59]
Potential
Jekel, David and Sandhu, Juspreet Singh and Shi, Jonathan , year = 2025, pages =. Potential. Proceedings of the 2025
2025
-
[60]
Jekel, David and Sandhu, Juspreet Singh and Shi, Jonathan , title =
-
[61]
Propagation of Chaos in the Random Field
Kabluchko, Zakhar and L. Propagation of Chaos in the Random Field. Journal of Statistical Mechanics: Theory and Experiment , volume =. doi:10.1088/1742-5468/ad8f2a , langid =
-
[62]
2014 , publisher =
Brownian motion and stochastic calculus , author =. 2014 , publisher =
2014
-
[63]
2012 , publisher =
Introduction to stochastic calculus with applications , author =. 2012 , publisher =
2012
-
[64]
Proceedings of Thirty Fifth Conference on Learning Theory , author =
Sampling Approximately Low-Rank Ising Models: MCMC meets Variational Methods , shorttitle =. Proceedings of Thirty Fifth Conference on Learning Theory , author =
-
[65]
arXiv preprint arXiv:2407.00868 , year =
Sampling from the Continuous Random Energy Model in Total Variation Distance , author =. arXiv preprint arXiv:2407.00868 , year =
-
[66]
Computing
Lehner, Franz , year = 1999, journal =. Computing
1999
-
[67]
Characterizing
Liang, Jiaming and Mitra, Siddharth and Wibisono, Andre , year = 2026, journal =. Characterizing
2026
-
[68]
2025 , eprint =
Fast mixing in Ising models with a negative spectral outlier via Gaussian approximation , author =. 2025 , eprint =
2025
-
[69]
Advances in neural information processing systems , volume=
A statistical model for tensor PCA , author=. Advances in neural information processing systems , volume=
-
[70]
2018 , publisher=
Statistical inference and the sum of squares method , author=. 2018 , publisher=
2018
-
[71]
Mathematical Programming , volume=
A relationship between the second derivatives of a convex function and of its conjugate , author=. Mathematical Programming , volume=. 1977 , publisher=
1977
-
[72]
2020 , publisher=
An introduction to statistical mechanics and thermodynamics , author=. 2020 , publisher=
2020
-
[73]
2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS) , pages=
Polynomial-time tensor decompositions with sum-of-squares , author=. 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS) , pages=. 2016 , organization=
2016
-
[74]
Aubin, Benjamin and Loureiro, Bruno and Maillard, Antoine and Krzakala, Florent and Zdeborov. The. IEEE Transactions on Information Theory , volume =
-
[75]
Signal, image and video processing , volume=
Application of semi-circle law and Wigner spiked-model in GPS jamming confronting , author=. Signal, image and video processing , volume=. 2023 , publisher=
2023
-
[76]
Random Structures & Algorithms , volume=
On the K-sat model with large number of clauses , author=. Random Structures & Algorithms , volume=. 2018 , publisher=
2018
-
[77]
arXiv preprint arXiv:2504.20539 , year=
Randomstrasse101: Open problems of 2024 , author=. arXiv preprint arXiv:2504.20539 , year=
-
[78]
49th International Colloquium on Automata, Languages, and Programming (ICALP 2022) , pages=
Limitations of Local Quantum Algorithms on Random MAX-k-XOR and Beyond , author=. 49th International Colloquium on Automata, Languages, and Programming (ICALP 2022) , pages=. 2022 , organization=
2022
-
[79]
14th Innovations in Theoretical Computer Science Conference (ITCS 2023) , pages=
Random Max-CSPs Inherit Algorithmic Hardness from Spin Glasses , author=. 14th Innovations in Theoretical Computer Science Conference (ITCS 2023) , pages=. 2023 , organization=
2023
-
[80]
2017 , publisher =
Free probability and random matrices , author =. 2017 , publisher =
2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.