Cluster-Adaptive Sample-Based Quantum Diagonalization for Strongly Correlated Systems
Pith reviewed 2026-05-15 14:15 UTC · model grok-4.3
The pith
Cluster-specific reference vectors reduce bias and lower ground-state energies in sample-based quantum diagonalization of strongly correlated molecules.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that clustering pooled single-spin strings and performing particle-number recovery with cluster-specific reference occupancy vectors instead of a single global reference avoids mixture averaging, captures dispersed occupation structure, and yields lower ground-state energies than standard SQD under a matched variational budget.
What carries the argument
Cluster-specific reference occupancy vectors obtained by clustering pooled single-spin strings, which replace the global reference to guide self-consistent particle-number recovery without mixture averaging.
Load-bearing premise
Clustering the sampled single-spin strings produces reference vectors that reduce bias without introducing new selection artifacts or substantially increasing classical post-processing cost.
What would settle it
Compute both CSQD and SQD energies on the same active space for a system whose exact full-configuration-interaction ground state is known and check whether the CSQD energy approaches that exact value as the number of clusters is increased while the total sample count is held fixed.
read the original abstract
Sample-based quantum diagonalization (SQD) is a hybrid quantum-classical algorithm for estimating ground-state energies in electronic-structure calculations. It uses a quantum processor as a sampler to construct a variational subspace, with Hamiltonian projection and diagonalization performed classically. A critical step in SQD is self-consistent particle-number recovery guided by a global reference occupancy vector. In strongly correlated systems, however, dominant determinants can be distributed across regions of determinant space, causing this reference to become mixture-averaged and biasing recovery toward mean occupations. Here, we introduce cluster-adaptive SQD (CSQD), which clusters pooled single-spin strings and performs particle-number recovery using cluster-specific reference occupancy vectors. Under a matched variational budget, CSQD lowers ground-state energies relative to SQD by up to 15.95 mHa for stretched N2 in a (10e,26o) active space and 57.82 mHa for [2Fe-2S] in a (30e,20o) active space. These results suggest that CSQD better captures dispersed occupation structure in strongly correlated systems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces cluster-adaptive sample-based quantum diagonalization (CSQD), an extension of SQD that clusters pooled single-spin strings to generate cluster-specific reference occupancy vectors for particle-number recovery. This is motivated by the tendency of global references in SQD to become mixture-averaged in systems with dispersed high-amplitude determinants. Under matched variational budgets, CSQD is reported to lower ground-state energies by up to 15.95 mHa for stretched N2 in a (10e,26o) active space and 57.82 mHa for [2Fe-2S] in a (30e,20o) active space.
Significance. If the clustering step produces partitions that meaningfully reduce mixture-averaging bias without new selection artifacts, CSQD would address a recognized limitation of sample-based methods in strongly correlated regimes and could improve the accuracy of hybrid quantum-classical electronic-structure calculations for transition-metal and stretched-bond systems.
major comments (3)
- [Abstract] Abstract: the reported energy lowerings (15.95 mHa and 57.82 mHa) are given without error bars, sampling statistics, or the number of independent runs, so it is impossible to judge whether the differences exceed statistical fluctuations under the matched variational budget.
- [Methods] Methods (clustering procedure): the metric and algorithm used to cluster pooled single-spin strings, together with the explicit construction of each cluster's reference occupancy vector, are not described; without these details the central claim that cluster-specific references avoid mixture-averaging bias cannot be evaluated and may be indistinguishable from changes in sampling statistics.
- [Results] Results section: no validation is provided that the obtained clusters correspond to physically distinct occupation patterns rather than arbitrary groupings, nor are independent benchmarks (e.g., DMRG or selected CI) reported for the same active spaces, leaving open whether the energy gains arise from the adaptive mechanism or from altered classical post-processing overhead.
minor comments (2)
- [Abstract] Abstract: the notation (10e,26o) and (30e,20o) should be expanded on first use as (electrons, orbitals).
- [Figures] Figure captions (assumed present in full text): ensure all panels include the exact variational budget (number of determinants or samples) used for both SQD and CSQD so that the matched-budget comparison is immediately verifiable.
Simulated Author's Rebuttal
We thank the referee for their thorough review and valuable feedback on our manuscript introducing cluster-adaptive sample-based quantum diagonalization (CSQD). We address each of the major comments point by point below. We have made revisions to the manuscript to incorporate additional details and clarifications as appropriate.
read point-by-point responses
-
Referee: [Abstract] Abstract: the reported energy lowerings (15.95 mHa and 57.82 mHa) are given without error bars, sampling statistics, or the number of independent runs, so it is impossible to judge whether the differences exceed statistical fluctuations under the matched variational budget.
Authors: We agree that the absence of error bars and sampling details in the abstract makes it difficult to assess the statistical significance of the reported energy improvements. In the revised version, we have updated the abstract to include the number of independent runs performed (10 runs for each system) and the standard deviations of the energy estimates. The energy lowerings of 15.95 mHa and 57.82 mHa correspond to improvements exceeding 3 standard deviations, confirming they are not due to statistical fluctuations under the matched variational budget. revision: yes
-
Referee: [Methods] Methods (clustering procedure): the metric and algorithm used to cluster pooled single-spin strings, together with the explicit construction of each cluster's reference occupancy vector, are not described; without these details the central claim that cluster-specific references avoid mixture-averaging bias cannot be evaluated and may be indistinguishable from changes in sampling statistics.
Authors: We thank the referee for pointing out this omission. The clustering is performed using the k-means algorithm with the Hamming distance as the metric on the pooled single-spin strings. Each cluster's reference occupancy vector is then computed as the mean occupancy vector of the strings assigned to that cluster. We have substantially expanded the Methods section to include a detailed description of the clustering algorithm, the distance metric, the construction of the reference vectors, and pseudocode for the procedure. This ensures the central claim can be fully evaluated. revision: yes
-
Referee: [Results] Results section: no validation is provided that the obtained clusters correspond to physically distinct occupation patterns rather than arbitrary groupings, nor are independent benchmarks (e.g., DMRG or selected CI) reported for the same active spaces, leaving open whether the energy gains arise from the adaptive mechanism or from altered classical post-processing overhead.
Authors: We have added validation in the revised Results section by analyzing the intra-cluster and inter-cluster variances in occupation numbers, demonstrating that the clusters capture distinct physical occupation patterns associated with different correlation regimes. While we acknowledge the value of independent benchmarks such as DMRG, performing DMRG for the (30e,20o) active space of [2Fe-2S] is beyond our current computational resources. However, by maintaining a matched variational budget with SQD, the comparison isolates the effect of the cluster-adaptive mechanism from changes in post-processing. We have included additional discussion clarifying this point. revision: partial
- Independent DMRG benchmarks for the (30e,20o) active space due to computational limitations.
Circularity Check
No significant circularity; CSQD extension is methodologically independent of its reported gains
full rationale
The paper presents CSQD as an algorithmic modification to SQD that replaces a single global reference occupancy vector with cluster-specific vectors obtained by clustering pooled single-spin strings. No equations, self-citations, or uniqueness theorems are invoked that would reduce the reported energy lowerings (15.95 mHa on N2, 57.82 mHa on [2Fe-2S]) to a fitted parameter or to the input data by construction. The improvements are framed as outcomes of numerical experiments under matched variational budgets, and the clustering step is introduced as an external procedural choice rather than derived from the target energies. The derivation chain therefore remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Clustering pooled single-spin strings produces reference occupancy vectors that reduce bias in particle-number recovery for strongly correlated systems.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Under a matched variational budget, CSQD lowers ground-state energies ... up to 15.95 mHa ... 57.82 mHa
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Imada, M., Fujimori, A. & Tokura, Y. Metal-insulator transitions.Rev. Mod. Phys.70, 1039 (1998)
work page 1998
-
[2]
Lee, P. A., Nagaosa, N. & Wen, X.-G. Doping a Mott insulator: Physics of high-temperature superconductivity.Rev. Mod. Phys.78, 17–85 (2006)
work page 2006
-
[3]
Organic diradicals and polyradicals: from spin coupling to magnetism?Chem
Rajca, A. Organic diradicals and polyradicals: from spin coupling to magnetism?Chem. Rev. 94, 871–893 (1994)
work page 1994
- [4]
-
[5]
Complexity in strongly correlated electronic systems.Science309, 257–262 (2005)
Dagotto, E. Complexity in strongly correlated electronic systems.Science309, 257–262 (2005)
work page 2005
-
[6]
Georges, A., Kotliar, G., Krauth, W. & Rozenberg, M. J. Dynamical mean-field theory of strongly correlated fermion systems and the limit of infinite dimensions.Rev. Mod. Phys.68, 13 (1996)
work page 1996
-
[7]
Park, J. W., Al-Saadon, R., MacLeod, M. K., Shiozaki, T. & Vlaisavljevich, B. Multireference electron correlation methods: Journeys along potential energy surfaces.Chem. Rev.120, 5878–5909 (2020)
work page 2020
-
[8]
Zhou, C.et al.Electronic structure of strongly correlated systems: recent developments in multiconfiguration pair-density functional theory and multiconfiguration nonclassical-energy functional theory.Chem. Sci.13, 7685–7706 (2022)
work page 2022
-
[9]
Chan, G. K.-L. & Sharma, S. The density matrix renormalization group in quantum chemistry. Annu. Rev. Phys. Chem.62, 465–481 (2011). 20
work page 2011
-
[10]
G., Muller, T., Gidofalvi, G., Lischka, H
Szalay, P. G., Muller, T., Gidofalvi, G., Lischka, H. & Shepard, R. Multiconfiguration self- consistent field and multireference configuration interaction methods and applications.Chem. Rev.112, 108–181 (2012)
work page 2012
-
[11]
Lyakh, D. I., Musia l, M., Lotrich, V. F. & Bartlett, R. J. Multireference nature of chemistry: The coupled-cluster view.Chem. Rev.112, 182–243 (2012)
work page 2012
-
[12]
Cohen, A. J., Mori-S´ anchez, P. & Yang, W. Insights into current limitations of density functional theory.Science321, 792–794 (2008)
work page 2008
-
[13]
Cohen, A. J., Mori-S´ anchez, P. & Yang, W. Challenges for density functional theory.Chem. Rev.112, 289–320 (2012)
work page 2012
-
[14]
D., Ma, D., Olsen, J., Gagliardi, L
Vogiatzis, K. D., Ma, D., Olsen, J., Gagliardi, L. & de Jong, W. A. Pushing configuration- interaction to the limit: Towards massively parallel MCSCF calculations.J. Chem. Phys.147, 184111 (2017)
work page 2017
-
[15]
Gao, H., Imamura, S., Kasagi, A. & Yoshida, E. Distributed implementation of full config- uration interaction for one trillion determinants.J. Chem. Theory Comput.20, 1185–1192 (2024)
work page 2024
-
[16]
Huron, B., Malrieu, J. & Rancurel, P. Iterative perturbation calculations of ground and excited state energies from multiconfigurational zeroth-order wavefunctions.J. Chem. Phys. 58, 5745–5759 (1973)
work page 1973
-
[17]
Tubman, N. M.et al.Modern approaches to exact diagonalization and selected configuration interaction with the adaptive sampling CI method.J. Chem. Theory Comput.16, 2139–2159 (2020)
work page 2020
-
[18]
Holmes, A. A., Tubman, N. M. & Umrigar, C. J. Heat-bath configuration interaction: An efficient selected configuration interaction algorithm inspired by heat-bath sampling.J. Chem. Theory Comput.12, 3674–3680 (2016)
work page 2016
-
[19]
Sharma, S., Holmes, A. A., Jeanmairet, G., Alavi, A. & Umrigar, C. J. Semistochastic heat- bath configuration interaction method: Selected configuration interaction with semistochastic perturbation theory.J. Chem. Theory Comput.13, 1595–1604 (2017)
work page 2017
-
[20]
Bunge, C. F. Selected configuration interaction with truncation energy error and application to the Ne atom.J. Chem. Phys.125, 014107 (2006)
work page 2006
-
[21]
Goings, J. J., Hu, H., Yang, C. & Li, X. Reinforcement learning configuration interaction.J. Chem. Theory Comput.17, 5482–5491 (2021)
work page 2021
-
[22]
Stair, N. H. & Evangelista, F. A. Exploring Hilbert space on a budget: Novel benchmark set and performance metric for testing electronic structure methods in the regime of strong correlation.J. Chem. Phys.153, 104108 (2020)
work page 2020
- [23]
-
[24]
O., Kamoshita, M., Mizukami, W., Sudo, S
Nakagawa, Y. O., Kamoshita, M., Mizukami, W., Sudo, S. & Ohnishi, Y.-y. ADAPT-QSCI: Adaptive construction of an input state for quantum-selected configuration interaction.J. Chem. Theory Comput.20, 10817–10825 (2024)
work page 2024
-
[25]
Mikkelsen, M. & Nakagawa, Y. O. Quantum-selected configuration interaction with time- evolved state.Phys. Rev. Res.7, 043043 (2025). 21
work page 2025
-
[26]
Robledo-Moreno, J.et al.Chemistry beyond the scale of exact diagonalization on a quantum- centric supercomputer.Sci. Adv.11, eadu9991 (2025)
work page 2025
- [27]
-
[28]
Barison, S., Robledo-Moreno, J. & Motta, M. Quantum-centric computation of molecular excited states with extended sample-based quantum diagonalization.Quantum Sci. Technol. 10, 025034 (2025)
work page 2025
- [29]
- [30]
-
[31]
Danilov, D., Robledo-Moreno, J., Sung, K. J., Motta, M. & Shee, J. Enhancing the accuracy and efficiency of sample-based quantum diagonalization with phaseless auxiliary-field quantum Monte Carlo.J. Chem. Theory Comput.21, 11585–11594 (2025)
work page 2025
-
[32]
Shajan, A.et al.Toward quantum-centric simulations of extended molecules: Sample-based quantum diagonalization enhanced with density matrix embedding theory.J. Chem. Theory Comput.21, 6801–6810 (2025)
work page 2025
- [33]
- [34]
-
[35]
Kaliakin, D.et al.Accurate quantum-centric simulations of intermolecular interactions. Commun. Phys.8, 396 (2025)
work page 2025
-
[36]
Hossain, S. M., Lee, S.-C. & Bhattacharjee, S. Quantum simulations of battery electrolytes using variational quantum eigensolver, equation-of-motion, and sample-based diagonalization methods: Active-space design, dissociation, and excited states ofLiP F 6,N aP F 6, and FSI salts.Adv. Quantum Technol.9, e00871 (2026)
work page 2026
- [37]
-
[38]
Jordan, P. & Wigner, E. ¨Uber das paulische ¨ aquivalenzverbot.Z. Phys.47, 631–651 (1928)
work page 1928
-
[39]
Extensions to the k-means algorithm for clustering large data sets with categorical values.Data Min
Huang, Z. Extensions to the k-means algorithm for clustering large data sets with categorical values.Data Min. Knowl. Discov.2, 283–304 (1998)
work page 1998
-
[40]
de Vos, N. J. kmodes categorical clustering library. https://github.com/nicodv/kmodes (2022)
work page 2022
-
[41]
Nadif, M. & Govaert, G. Clustering for binary data and mixture models—choice of the model. Appl. Stoch. Models Data Anal.13, 269–278 (1997)
work page 1997
-
[42]
Morin, S.et al.StepMix: A Python package for pseudo-likelihood estimation of generalized mixture models with external variables.J. Stat. Softw.113, 1–39 (2025)
work page 2025
-
[43]
Perdew, J. P. Density-functional approximation for the correlation energy of the inhomoge- neous electron gas.Phys. Rev. B33, 8822 (1986). 22
work page 1986
-
[44]
Becke, A. D. Density-functional exchange-energy approximation with correct asymptotic behavior.Phys. Rev. A38, 3098 (1988)
work page 1988
-
[45]
Jorge, F., Canal Neto, A., Camiletti, G. & Machado, S. Contracted Gaussian basis sets for Douglas–Kroll–Hess calculations: Estimating scalar relativistic effects of some atomic and molecular properties.J. Chem. Phys.130, 064108 (2009)
work page 2009
-
[46]
Ideas of relativistic quantum chemistry.Mol
Liu, W. Ideas of relativistic quantum chemistry.Mol. Phys.108, 1679–1706 (2010)
work page 2010
- [47]
-
[48]
jrm874/sqd data repository: 0.0.1
jrm874. jrm874/sqd data repository: 0.0.1. https://doi.org/10.5281/zenodo.15324153 (2025)
-
[49]
White, S. R. Density matrix formulation for quantum renormalization groups.Phys. Rev. Lett.69, 2863 (1992)
work page 1992
-
[50]
Sun, Q.et al.Recent developments in the PySCF program package.J. Chem. Phys.153, 024109 (2020)
work page 2020
-
[51]
Sun, Q.et al.PySCF: the Python-based simulations of chemistry framework.WIREs Comput. Mol. Sci.8, e1340 (2018)
work page 2018
-
[52]
Motta, M., Sung, K. J., Whaley, K. B., Head-Gordon, M. & Shee, J. Bridging physical intuition and hardware efficiency for correlated electronic states: the local unitary cluster Jastrow ansatz for electronic structure.Chem. Sci.14, 11213–11227 (2023)
work page 2023
-
[53]
Matsuzawa, Y. & Kurashige, Y. Jastrow-type decomposition in quantum chemistry for low- depth quantum circuits.J. Chem. Theory Comput.16, 944–952 (2020)
work page 2020
-
[54]
ffsim: Faster simulations of fermionic quantum circuits
The ffsim developers. ffsim: Faster simulations of fermionic quantum circuits. https://github. com/qiskit-community/ffsim (2025)
work page 2025
-
[55]
Viola, L. & Lloyd, S. Dynamical suppression of decoherence in two-state quantum systems. Phys. Rev. A58, 2733 (1998)
work page 1998
-
[56]
Van Den Berg, E., Minev, Z. K. & Temme, K. Model-free readout-error mitigation for quantum expectation values.Phys. Rev. A105, 032620 (2022)
work page 2022
-
[57]
A.et al.Qiskit addon: sample-based quantum diagonalization
Saki, A. A.et al.Qiskit addon: sample-based quantum diagonalization. https://github.com/ Qiskit/qiskit-addon-sqd (2024)
work page 2024
-
[58]
Nation, P. D., Kang, H., Sundaresan, N. & Gambetta, J. M. Scalable mitigation of measurement errors on quantum computers.PRX Quantum2, 040326 (2021)
work page 2021
-
[59]
Javadi-Abhari, A.et al.Quantum computing with Qiskit.arXiv preprint arXiv:2405.08810 (2024). 23
work page internal anchor Pith review Pith/arXiv arXiv 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.