Private training in quantum machine learning
Pith reviewed 2026-06-30 07:34 UTC · model grok-4.3
The pith
Hybrid quantum models retain higher accuracy than classical ones when trained with classical DP-SGD privacy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For a wide class of quantum models, deterministic bounds on gradient norms translate into explicit control of the detrimental clipping bias introduced by DP-SGD, so that quantum models retain higher accuracy than classical equivalents in private-training regimes where the formal privacy guarantee is supplied by the classical DP-SGD mechanism.
What carries the argument
Deterministic bounds on gradient norms for variational quantum models, used to quantify and limit the bias from gradient clipping in DP-SGD.
If this is right
- Quantum noise does not provide a satisfactory replacement for the calibrated noise required by DP-SGD.
- Gradient-norm bounds reduce the optimization damage caused by clipping, preserving convergence behavior under privacy constraints.
- Under fixed clipping and privacy budget the quantum pipelines show measurably higher final accuracy on both synthetic and image tasks.
Where Pith is reading between the lines
- The advantage may extend to other hybrid quantum-classical pipelines where gradient statistics are easier to bound than in fully classical deep networks.
- If the norm bounds scale favorably with qubit number, larger quantum models could widen the accuracy gap under the same privacy budget.
- The result motivates checking whether quantum-specific privacy mechanisms could further improve the observed retention.
Load-bearing premise
The gradient norms of the quantum models remain bounded in a way that directly limits clipping bias under the chosen threshold.
What would settle it
An experiment on the same synthetic and image-classification tasks that applies identical clipping threshold and privacy budget and finds that the quantum models achieve equal or lower accuracy than the classical models.
read the original abstract
With the emergence of machine learning (ML) models trained on large datasets containing potentially sensitive data, a major question in AI safety is how to make learning private with respect to the training data. Similar to classical machine learning, quantum machine learning (QML) models are not devoid of privacy vulnerabilities. Differential privacy (DP) is a standard tool for training ML models on sensitive data, but its impact in QML remains poorly understood. In this work we study private training in hybrid variational QML models using a classical private DP-SGD optimizer applied to pipelines with classical inputs and outputs. We analyze the interplay between gradient clipping and calibrated noise addition in DP-SGD, and its impact on optimization and accuracy for noisy and noiseless quantum models. We first explain why quantum noise does not provide a satisfactory replacement for the calibrated noise in DP-SGD for ensuring privacy. We then show how the deterministic bounds on gradient norms for a wide class of quantum models translate into explicit control of the detrimental clipping bias introduced by DP-SGD. Finally, we formulate a numerical comparison protocol under fixed clipping threshold and privacy budget and evaluate it on synthetic and image-classification tasks for equivalent quantum and classical models. Our results suggest that quantum models can retain higher accuracy in private-training regimes where the formal privacy guarantee is ensured by a classical DP-SGD mechanism.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript studies private training of hybrid variational QML models (classical inputs/outputs) via a classical DP-SGD optimizer. It first argues that quantum noise cannot replace the calibrated noise required for formal DP guarantees, then claims that deterministic bounds on gradient norms for a wide class of quantum models yield explicit control over the clipping bias induced by DP-SGD, and finally reports a numerical comparison (fixed clipping threshold and privacy budget) on synthetic and image-classification tasks in which quantum models retain higher accuracy than equivalent classical models.
Significance. If the gradient-norm argument is made rigorous and the numerical protocol is fully specified with statistical controls, the work would provide concrete evidence that certain quantum models can mitigate the accuracy penalty of DP-SGD clipping, offering a potential practical advantage for privacy-preserving QML. The explicit fixed-budget comparison protocol is a positive methodological contribution that could be adopted more broadly.
major comments (2)
- [gradient bounds paragraph] The section deriving the gradient-norm bounds (the paragraph beginning 'we then show how the deterministic bounds...'): the claim that these bounds 'translate into explicit control of the detrimental clipping bias' is load-bearing for the accuracy-retention advantage, yet the manuscript supplies neither a uniformity statement over stochastic mini-batches nor a data-dependent verification that per-batch norms remain below the clipping threshold; without this step the reduced clipping damage does not necessarily follow from the worst-case bound.
- [§4] §4 (numerical comparison protocol): the central claim that 'quantum models can retain higher accuracy' rests on an unreported protocol; the manuscript must supply exact dataset sizes, number of independent runs, error bars or confidence intervals, and the precise hyper-parameter values used for both quantum and classical models so that the reported advantage can be assessed for statistical significance.
minor comments (2)
- [Introduction] The distinction between 'noisy and noiseless quantum models' is introduced in the abstract but receives only a brief mention in the main text; a short clarifying sentence in the introduction would improve readability.
- [Tables] Table captions should explicitly state the clipping threshold and privacy budget (ε,δ) used for each row so that the fixed-budget protocol is immediately visible without cross-referencing the text.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments on our manuscript. We agree that the gradient-norm section requires an explicit uniformity statement and that the numerical protocol in §4 must be fully specified for reproducibility and statistical assessment. We will revise the manuscript to address both points.
read point-by-point responses
-
Referee: [gradient bounds paragraph] The section deriving the gradient-norm bounds (the paragraph beginning 'we then show how the deterministic bounds...'): the claim that these bounds 'translate into explicit control of the detrimental clipping bias' is load-bearing for the accuracy-retention advantage, yet the manuscript supplies neither a uniformity statement over stochastic mini-batches nor a data-dependent verification that per-batch norms remain below the clipping threshold; without this step the reduced clipping damage does not necessarily follow from the worst-case bound.
Authors: The gradient-norm bounds derived in the manuscript are deterministic and hold uniformly over the entire input domain for the considered class of quantum models. Because the mini-batch gradient is a convex combination of per-sample gradients, the same uniform bound applies directly to every mini-batch; we will add an explicit statement to this effect in the revised manuscript. We also agree that including empirical verification of per-batch norms (relative to the chosen clipping threshold) in the numerical experiments would strengthen the link to clipping bias, and we will add this verification. revision: yes
-
Referee: [§4] §4 (numerical comparison protocol): the central claim that 'quantum models can retain higher accuracy' rests on an unreported protocol; the manuscript must supply exact dataset sizes, number of independent runs, error bars or confidence intervals, and the precise hyper-parameter values used for both quantum and classical models so that the reported advantage can be assessed for statistical significance.
Authors: We agree that the experimental protocol was insufficiently detailed. In the revised manuscript we will report the exact dataset sizes, the number of independent runs performed, error bars together with confidence intervals, and the complete hyper-parameter settings (including optimizer, learning rate, clipping threshold, noise multiplier, and model architectures) for both the quantum and classical models. revision: yes
Circularity Check
No circularity: derivation chain is self-contained with independent numerical and analytical content
full rationale
The paper's load-bearing steps are (1) an explanation why quantum noise cannot replace DP-SGD noise, (2) a claimed translation of deterministic gradient-norm bounds into clipping-bias control, and (3) a fixed-budget numerical comparison protocol evaluated on synthetic and image tasks. None of these reduce by construction to a fitted parameter, a self-definition, or a self-citation chain; the accuracy-retention claim is presented as an empirical outcome of the protocol rather than a renaming or tautology. The abstract and described structure contain no equations that equate a prediction to its own input, and the cited bounds are treated as external inputs to the DP-SGD analysis rather than derived from the present results. This is the normal case of an independent derivation.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Dwork, C., Roth, A.: The algorithmic foundations of differential privacy. Found. Trends Theor. Comput. Sci.9(3–4), 211–407 (2014) https://doi.org/10.1561/ 0400000042
2014
-
[2]
In: Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, pp
Abadi, M., Chu, A., Goodfellow, I., McMahan, H.B., Mironov, I., Talwar, K., Zhang, L.: Deep learning with differential privacy. In: Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, pp. 308– 318 (2016). https://doi.org/10.1145/2976749.2978318
-
[3]
Mironov, I.: R´ enyi differential privacy. 2017 IEEE 30th Computer Security Foun- dations Symposium (CSF), 263–275 (2017) https://doi.org/10.48550/arXiv.1702. 07476 arXiv:1702.07476
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1702 2017
-
[4]
Journal of Privacy and Confidentiality 10(2) (2021) https://doi.org/10.29012/jpc.723
Wang, Y.-X., Balle, B., Kasiviswanathan, S.: Subsampled r´ enyi differential pri- vacy and analytical moments accountant. Journal of Privacy and Confidentiality 10(2) (2021) https://doi.org/10.29012/jpc.723
-
[5]
In: Proceedings of the 34th International Conference on Neural Information Processing Systems
Chen, X., Wu, Z.S., Hong, M.: Understanding gradient clipping in private sgd: a geometric perspective. In: Proceedings of the 34th International Conference on Neural Information Processing Systems. NIPS ’20. Curran Associates Inc., Red Hook, NY, USA (2020)
2020
-
[6]
In: Krause, A., Brunskill, E., Cho, K., Engelhardt, B., Sabato, S., Scarlett, J
Koloskova, A., Hendrikx, H., Stich, S.U.: Revisiting gradient clipping: Stochas- tic bias and tight convergence guarantees. In: Krause, A., Brunskill, E., Cho, K., Engelhardt, B., Sabato, S., Scarlett, J. (eds.) Proceedings of the 40th Inter- national Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 202, pp. 17343–17363. PMLR...
2023
-
[7]
https://arxiv.org/abs/1908.07643
Pichapati, V., Suresh, A.T., Yu, F.X., Reddi, S.J., Kumar, S.: AdaCliP: Adaptive Clipping for Private SGD (2019). https://arxiv.org/abs/1908.07643
- [8]
- [9]
-
[10]
Scientific Reports13, 2023 (2023) https://doi.org/10.1038/ s41598-022-24082-z arXiv:2103.06232
Watkins, W.M., Chen, S.Y.-C., Yoo, S.: Quantum machine learning with dif- ferential privacy. Scientific Reports13, 2023 (2023) https://doi.org/10.1038/ s41598-022-24082-z arXiv:2103.06232
-
[11]
Rofougaran, R., Yoo, S., Tseng, H.-H., Chen, S.Y.-C.: Federated quantum 25 machine learning with differential privacy. In: ICASSP 2024 - 2024 IEEE Inter- national Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 9811–9815 (2024). https://doi.org/10.1109/ICASSP48485.2024.10447155
- [12]
-
[13]
Papernot, N., Thakurta, A., Song, S., Chien, S., Erlingsson, ´U.: Tempered sigmoid activations for deep learning with differential privacy. Proceedings of the AAAI Conference on Artificial Intelligence35(10), 9312–9321 (2021) https://doi.org/ 10.1609/aaai.v35i10.17123
-
[14]
In: Balcan, M.F., Weinberger, K.Q
Arjovsky, M., Shah, A., Bengio, Y.: Unitary evolution recurrent neural net- works. In: Balcan, M.F., Weinberger, K.Q. (eds.) Proceedings of The 33rd International Conference on Machine Learning. Proceedings of Machine Learn- ing Research, vol. 48, pp. 1120–1128. PMLR, New York, New York, USA (2016). https://proceedings.mlr.press/v48/arjovsky16.html
2016
-
[15]
In: Precup, D., Teh, Y.W
Mhammedi, Z., Hellicar, A., Rahman, A., Bailey, J.: Efficient orthogo- nal parametrisation of recurrent neural networks using householder reflec- tions. In: Precup, D., Teh, Y.W. (eds.) Proceedings of the 34th Inter- national Conference on Machine Learning. Proceedings of Machine Learn- ing Research, vol. 70, pp. 2401–2409. PMLR, Sydney, Australia (2017)....
2017
-
[16]
In: The Twelfth International Conference on Learning Representations (2024).https://openreview.net/forum?id=BEyEziZ4R6
B´ ethune, L., Massena, T., Boissin, T., Bellet, A., Mamalet, F., Prudent, Y., Friedrich, C., Serrurier, M., Vigouroux, D.: DP-SGD without clipping: The lips- chitz neural network way. In: The Twelfth International Conference on Learning Representations (2024).https://openreview.net/forum?id=BEyEziZ4R6
2024
-
[17]
Hirche, C., Rouz´ e, C., Fran¸ ca, D.S.: Quantum differential privacy: An information theory perspective. IEEE Transactions on Information Theory69(9), 5771–5787 (2023) https://doi.org/10.1109/TIT.2023.3272904
-
[18]
https://arxiv.org/abs/2307.04733
Angrisani, A., Doosti, M., Kashefi, E.: A unifying framework for differentially private quantum algorithms (2023). https://arxiv.org/abs/2307.04733
-
[19]
Du, Y., Hsieh, M.-H., Liu, T., Tao, D., Liu, N.: Quantum noise protects quantum classifiers against adversaries. Phys. Rev. Res.3, 023153 (2021) https://doi.org/ 10.1103/PhysRevResearch.3.023153
-
[20]
In: 2024 IEEE International Conference on Quantum Computing and Engineering (QCE), vol
Zhao, Y., Zhong, H., Zhang, X., Li, Y., Zhang, C., Pan, M.: Bridging quantum computing and differential privacy: Insights into quantum computing privacy. In: 2024 IEEE International Conference on Quantum Computing and Engineering (QCE), vol. 01, pp. 13–24 (2024). https://doi.org/10.1109/QCE60285.2024.00012
-
[21]
npj Quantum Information11(2025) https://doi.org/10.1038/s41534-025-01022-z
Heredge, J., Kumar, N., Herman, D., Chakrabarti, S., Yalovetzky, R., Sureshbabu, 26 S.H., Li, C., Pistoia, M.: Characterizing privacy in quantum machine learning. npj Quantum Information11(2025) https://doi.org/10.1038/s41534-025-01022-z
-
[22]
Su, J., He, R., Li, G., Qin, S., He, Z., Situ, H., Gao, F.: From membership-privacy leakage to quantum machine unlearning (2025) arXiv:2509.06086
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[23]
EPJ Quantum Technology11(1), 72 (2024) https://doi.org/10.1140/epjqt/ s40507-024-00285-3
Rath, M., Date, H.: Quantum data encoding: a comparative analysis of classical- to-quantum mapping techniques and their impact on machine learning accu- racy. EPJ Quantum Technology11(1), 72 (2024) https://doi.org/10.1140/epjqt/ s40507-024-00285-3
-
[24]
Quantum4, 226 (2020) https: //doi.org/10.22331/q-2020-02-06-226
P´ erez-Salinas, A., Cervera-Lierta, A., Gil-Fuster, E., Latorre, J.I.: Data re- uploading for a universal quantum classifier. Quantum4, 226 (2020) https: //doi.org/10.22331/q-2020-02-06-226
-
[25]
Nature Reviews Physics7(4), 174–189 (2025) https://doi
Larocca, M., Thanasilp, S., Wang, S., Sharma, K., Biamonte, J., Coles, P.J., Cincio, L., McClean, J.R., Holmes, Z., Cerezo, M.: Barren plateaus in variational quantum computing. Nature Reviews Physics7(4), 174–189 (2025) https://doi. org/10.1038/s42254-025-00813-9
-
[26]
URL https://dblp.org/rec/conf/sp/ShokriSSS17
Shokri, R., Stronati, M., Song, C., Shmatikov, V.: Membership inference attacks against machine learning models. In: IEEE Symposium on Security and Privacy, pp. 3–18 (2017). https://doi.org/10.1109/SP.2017.41
-
[27]
Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting
Yeom, S., Giacomelli, I., Fredrikson, M., Jha, S.: Privacy risk in machine learning: Analyzing the connection to overfitting. 2018 IEEE Computer Security Foun- dations Symposium (CSF), 268–282 (2018) https://doi.org/10.1109/CSF.2018. 00027 arXiv:1709.01604
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1109/csf.2018 2018
-
[28]
Carlini, N., Chien, S., Nasr, M., Song, S., Terzis, A., Tram` er, F.: Membership inference attacks from first principles. In: 2022 IEEE Symposium on Security and Privacy (SP), pp. 1897–1914 (2022). https://doi.org/10.1109/SP46214.2022. 9833649
-
[29]
Kairouz, P., McMahan, H.B., Avent, B., Bellet, A., Bennis, M., Bhagoji, A.N., Bonawitz, K., Charles, Z., Cormode, G., Cummings, R., D’Oliveira, R.G.L., Eich- ner, H., El Rouayheb, S., Evans, D., Gardner, J., Garrett, Z., Gasc´ on, A., Ghazi, B., Gibbons, P.B., Gruteser, M., Harchaoui, Z., He, C., He, L., Huo, Z., Hutchin- son, B., Hsu, J., Jaggi, M., Javi...
-
[30]
IEEE Internet of Things Journal7(7), 5827–5842 (2020) https://doi.org/10.1109/JIOT.2019.2952146
Mahawaga Arachchige, P.C., Bertok, P., Khalil, I., Liu, D., Camtepe, S., Atiquz- zaman, M.: Local differential privacy for deep learning. IEEE Internet of Things Journal7(7), 5827–5842 (2020) https://doi.org/10.1109/JIOT.2019.2952146
-
[31]
In: 2019 IEEE Symposium on Security and Privacy (SP), pp
Lecuyer, M., Atlidakis, V., Geambasu, R., Hsu, D., Jana, S.: Certified robustness to adversarial examples with differential privacy. In: 2019 IEEE Symposium on Security and Privacy (SP), pp. 656–672 (2019). https://doi.org/10.1109/SP.2019. 00044
-
[32]
Quantum6, 677 (2022) https://doi.org/10.22331/ q-2022-03-30-677
Wierichs, D., Izaac, J., Wang, C., Lin, C.Y.-Y.: General parameter-shift rules for quantum gradients. Quantum6, 677 (2022) https://doi.org/10.22331/ q-2022-03-30-677
2022
-
[33]
In: 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE), pp
Khanal, B., Rivas, P.: Evaluating the impact of noise on variational quantum circuits in nisq era devices. In: 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE), pp. 1658–1664 (2023). https://doi. org/10.1109/CSCE60160.2023.00272
-
[34]
Nature Physics22(5), 751–756 (2026) https://doi.org/10.1038/ s41567-026-03245-z
Mele, A.A., Angrisani, A., Ghosh, S., Khatri, S., Eisert, J., Stilck Fran¸ ca, D., Quek, Y.: Noise-induced shallow circuits and the absence of bar- ren plateaus. Nature Physics22(5), 751–756 (2026) https://doi.org/10.1038/ s41567-026-03245-z
2026
-
[35]
GitHub (2026)
Sedrakyan, T.: private-qml: Code for ”Private training in quantum machine learning”. GitHub (2026). https://github.com/tigran-sedrakyan/private-qml
2026
-
[36]
Dynamic Traffic Scene Classification with Space-Time Coherence
Narayanan, A., Dwivedi, I., Dariush, B.: Dynamic Traffic Scene Classification with Space-Time Coherence (2019). https://arxiv.org/abs/1905.12708
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[37]
Gharibyan, H., Karapetyan, H., Sedrakyan, T., Subasic, P., Su, V.P., Tanin, R.H., Tepanyan, H.: Quantum image loading and classification: experiments on utility- scale quantum computers. Quantum Machine Intelligence8(1), 57 (2026) https: //doi.org/10.1007/s42484-026-00388-3 28 A Proofs of Theorems Theorem 1(Quantum noise propagation to loss gradients)Cons...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.