StableShots: Online Shot Stopping for Quantum Circuit Execution
Pith reviewed 2026-06-26 11:39 UTC · model grok-4.3
The pith
StableShots stops quantum circuit measurements once cumulative distributions show repeated local stability in total variation distance.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
StableShots executes a fixed circuit in small batches, monitors the total-variation distance between cumulative empirical distributions, and stops after repeated evidence of local stability. With validation-only calibration and 100 repeated backend-holdout splits across 180 QSimBench traces spanning six circuit families, six sizes from 4 to 14 qubits, and five noisy IBM simulated backends, the selected configuration reaches TVD <= 0.05 on all held-out test evaluations with median 7,650 shots, whereas fixed-shot baselines either fail more often or spend substantially more shots.
What carries the argument
The online stopping rule that halts after repeated local stability in total-variation distance between cumulative empirical distributions from small batches of shots.
If this is right
- The adaptive rule meets the TVD <= 0.05 target on every held-out evaluation while using a median of only 7,650 shots.
- Fixed-shot baselines either exceed the TVD target more frequently or require substantially higher shot counts to match the same reliability.
- The calibration uses only validation data and generalizes across 100 backend-holdout splits without retraining per test backend.
- The same stability criterion applies uniformly to six circuit families and five noisy simulated backends spanning 4 to 14 qubits.
Where Pith is reading between the lines
- Circuit compilers or runtime libraries could embed the stopping rule to allocate shots automatically instead of requiring users to guess budgets.
- The batch-stability approach might extend to other sampling-based quantum tasks such as variational algorithms where distribution convergence is also the goal.
- Combining the rule with existing error-mitigation post-processing could further reduce the effective shot cost needed for a target accuracy.
Load-bearing premise
Repeated evidence of local stability in total-variation distance between cumulative empirical distributions reliably signals that the empirical distribution has converged sufficiently to the true output distribution.
What would settle it
Running the method on additional circuits or real hardware and finding cases where it stops yet later independent measurements yield TVD above 0.05 on a substantial fraction of trials.
read the original abstract
Quantum circuit execution estimates output distributions by repeated measurements, yet developers commonly choose a fixed shot budget before execution. This static choice is brittle: low budgets can under-sample the distribution, while high budgets waste measurements. In this paper, we present StableShots, a black-box online stopping rule for static quantum circuits. The method executes a fixed circuit in small batches, monitors the total-variation distance between cumulative empirical distributions, and stops after repeated evidence of local stability. We evaluate StableShots on 180 QSimBench traces spanning six circuit families, six sizes from 4 to 14 qubits, and five noisy IBM simulated backends. With validation-only calibration and 100 repeated backend-holdout splits, the selected configuration reaches TVD <= 0.05 on all held-out test evaluations with median 7,650 shots, whereas fixed-shot baselines either fail more often or spend substantially more shots.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces StableShots, a black-box online stopping rule for static quantum circuits. Circuits are executed in small batches; the method monitors total-variation distance between successive cumulative empirical distributions and halts after repeated evidence of local stability. On 180 QSimBench traces (six circuit families, 4–14 qubits, five noisy IBM simulated backends), validation-only calibration plus 100 repeated backend-holdout splits yields a configuration that attains TVD ≤ 0.05 on every held-out test evaluation at a median of 7,650 shots, outperforming fixed-shot baselines that either exceed the TVD threshold more often or consume substantially more shots.
Significance. If the central empirical claim holds, the work supplies a practical, calibration-light procedure for dynamically allocating shots while controlling distribution error. The repeated backend-holdout design with validation-only calibration is a methodological strength that reduces the risk of test-set overfitting. The breadth of circuit families and backends supplies useful evidence of applicability within the evaluated regime.
major comments (2)
- The headline performance (TVD ≤ 0.05 on all held-out evaluations) rests on the unproven link that repeated small-batch TVD stability between cumulative empirical distributions reliably signals proximity to the backend’s true output distribution. This assumption can fail when low-probability outcomes remain unsampled or when noise produces early plateaus; the 100 splits supply empirical support only inside the tested set and do not constitute a general guarantee.
- [Evaluation] The abstract and evaluation description report the selected configuration but supply neither the exact stability criterion (number of consecutive stable batches, TVD threshold), the batch size, nor any sensitivity analysis of these free parameters. Without these details the reported median of 7,650 shots cannot be reproduced or stress-tested outside the authors’ implementation.
Simulated Author's Rebuttal
We thank the referee for the thoughtful review and for highlighting important points regarding the empirical nature of our claims and reproducibility. We respond to each major comment below.
read point-by-point responses
-
Referee: The headline performance (TVD ≤ 0.05 on all held-out evaluations) rests on the unproven link that repeated small-batch TVD stability between cumulative empirical distributions reliably signals proximity to the backend’s true output distribution. This assumption can fail when low-probability outcomes remain unsampled or when noise produces early plateaus; the 100 splits supply empirical support only inside the tested set and do not constitute a general guarantee.
Authors: We agree that StableShots is a heuristic method without a theoretical convergence guarantee to the true distribution. The manuscript frames the approach as an empirical online stopping rule whose reliability is supported by the repeated backend-holdout evaluation across 180 traces. We will revise the introduction and discussion sections to explicitly note the heuristic character, to acknowledge potential failure modes such as unsampled rare outcomes or noise-induced plateaus, and to clarify that the reported performance is specific to the evaluated regime rather than a general proof. revision: yes
-
Referee: [Evaluation] The abstract and evaluation description report the selected configuration but supply neither the exact stability criterion (number of consecutive stable batches, TVD threshold), the batch size, nor any sensitivity analysis of these free parameters. Without these details the reported median of 7,650 shots cannot be reproduced or stress-tested outside the authors’ implementation.
Authors: The full methods section of the manuscript specifies the batch size, the number of consecutive stable batches required, and the internal TVD threshold used for the stopping decision, along with the validation-only calibration procedure. However, these parameters are not restated in the abstract or the main evaluation narrative. We will revise the evaluation section to include the exact hyperparameter values and add a sensitivity analysis (varying batch size and stability window) to improve reproducibility and allow external stress-testing. revision: yes
Circularity Check
No significant circularity; evaluation uses independent hold-outs
full rationale
The paper defines StableShots as an online stopping rule that monitors repeated local stability in TVD between successive cumulative empirical distributions computed from small batches. Its headline performance claim (TVD <= 0.05 on all held-out evaluations, median 7,650 shots) is obtained after validation-only calibration across 100 repeated backend-holdout splits on 180 QSimBench traces. Because the test metric is measured on data never seen during calibration and the stopping rule itself contains no fitted parameters that are renamed as predictions, no equation or self-citation reduces the reported result to its inputs by construction. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (2)
- batch size
- stability criterion
axioms (1)
- domain assumption Total variation distance between cumulative empirical distributions is an appropriate indicator of local stability for the purpose of deciding when to stop sampling.
Reference graph
Works this paper leans on
-
[1]
M. A. Nielsen and I. L. Chuang,Quantum Computation and Quantum Information. Cambridge University Press, 2010
2010
-
[2]
From quantum software handcrafting to quantum software engineering,
G. Bisicchia et al., “From quantum software handcrafting to quantum software engineering,” inProc. IEEE SANER-C, 2024
2024
-
[3]
Probability inequalities for sums of bounded random variables,
W. Hoeffding, “Probability inequalities for sums of bounded random variables,”Journal of the American Statistical Association, 1963
1963
-
[4]
Inequalities for theL 1 deviation of the empirical distribution,
T. Weissman et al., “Inequalities for theL 1 deviation of the empirical distribution,” Hewlett-Packard Labs, Tech. Rep., 2003
2003
-
[5]
Concentration inequalities for the empirical distribu- tion,
J. Mardia et al., “Concentration inequalities for the empirical distribu- tion,” arXiv:1809.06522, 2018
arXiv 2018
-
[6]
Optimizing shot assignment in variational quantum eigen- solver measurement,
L. Zhu et al., “Optimizing shot assignment in variational quantum eigen- solver measurement,”Journal of Chemical Theory and Computation, 2024
2024
-
[7]
Adaptive shot allocation for fast convergence in variational quantum algorithms,
A. Gu et al., “Adaptive shot allocation for fast convergence in variational quantum algorithms,” arXiv:2108.10434, 2021
arXiv 2021
-
[8]
Artificial-intelligence-driven shot reduction in quantum measurement,
S. Liang et al., “Artificial-intelligence-driven shot reduction in quantum measurement,”Chemical Physics Reviews, 2024
2024
-
[9]
Distributing quantum computations, shot-wise,
G. Bisicchia et al., “Distributing quantum computations, shot-wise,” in Future Internet, 2025
2025
-
[10]
Rethinking Services in the Quantum Age: The SOQ Paradigm,
J. Garcia-Alonso et al., “Rethinking Services in the Quantum Age: The SOQ Paradigm,” inACM Transactions on Software Engineering and Methodology, 2026
2026
-
[11]
QSimBench: An execution-level benchmark suite for quantum software engineering,
G. Bisicchia et al., “QSimBench: An execution-level benchmark suite for quantum software engineering,” inProc. IEEE QCE, 2025
2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.