pith. sign in

arxiv: 2605.29557 · v1 · pith:EISJCVDYnew · submitted 2026-05-28 · 🪐 quant-ph

Quantum Subliminal Learning

Pith reviewed 2026-06-29 07:10 UTC · model grok-4.3

classification 🪐 quant-ph
keywords subliminal learningquantum neural networksmodel distillationhidden information transfergeometric driftquantum securitytask channel
0
0 comments X

The pith

Quantum neural networks retain most hidden-task signals through public-task interfaces while classical networks transmit little.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines whether machine learning models can pick up hidden behavioral traits from training data or interfaces that appear only to address a public task. It compares classical neural networks with quantum neural networks across two distillation setups: one using random inputs and another forcing the student to match only public supervised outputs while the hidden behavior lives on a separate task. Both architectures transmit hidden information efficiently when the channel uses random inputs, but the public-task interface reveals a sharp difference: quantum models keep most of the hidden signal whereas classical models largely block it. A single geometric description accounts for the difference by tracking how much of the teacher’s drift that relates to the hidden task stays visible after the public interface is applied. The result flags a concrete risk in quantum model supply chains and points to a possible method for controlled transfer of hidden quantum information.

Core claim

Both classical and quantum neural networks exhibit efficient subliminal learning through an auxiliary channel on random inputs, yet the restricted task channel displays strong architecture dependence: classical networks transmit little hidden-task information through the public-task interface whereas quantum neural networks retain most of the hidden-task signal. Transmission in both regimes is governed by a unified geometric picture in which the amount of hidden information that reaches the student is set by the magnitude of the teacher drift together with the fraction of that drift which remains visible once the public interface is imposed.

What carries the argument

The geometric drift picture, in which hidden-task transmission is set by teacher drift magnitude and the visible fraction of hidden-task-relevant drift through the public interface.

If this is right

  • Quantum models can inherit hidden behavioral traits even when the student is trained exclusively on public supervised outputs.
  • Classical models largely prevent such inheritance through the same public-task interface.
  • The amount of transmitted hidden information scales with teacher drift magnitude and the visible fraction of hidden-relevant drift.
  • The architecture dependence creates a concrete security concern for quantum model supply chains.
  • The same mechanism offers a potential controlled route for hidden-information transfer in quantum information processing.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the drift picture generalizes, hardware implementations of quantum models may need explicit visibility filters on public interfaces to limit unintended information leakage.
  • The contrast between auxiliary and task channels suggests that random-input distillation may be a reliable way to move hidden quantum states even when task-specific training is restricted.
  • Testing whether the same architecture split appears in variational quantum algorithms versus tensor-network classical models would clarify whether the effect is tied to quantum superposition or to the specific circuit structure used.

Load-bearing premise

The two distillation pathways studied are representative of realistic model-distillation scenarios and the geometric drift description holds without extra factors such as noise or architecture-specific regularization.

What would settle it

A measurement showing that quantum neural networks trained only on public-task labels lose nearly all hidden-task performance on the disjoint task, comparable to the loss seen in classical networks under identical conditions.

Figures

Figures reproduced from arXiv: 2605.29557 by Shi-Xin Zhang, Yu-Qin Chen.

Figure 1
Figure 1. Figure 1: FIG. 1. Overview of the two subliminal learning settings and representative results. Panel (a) sketches the two distillation [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2. Auxiliary-channel subliminal learning trends across classical and quantum models. Panels (a) and (b) show teacher [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3. Task-channel visibility and transmission. Panel (a) compares two CNN controls, a narrow classical MLP, and the [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
read the original abstract

Machine learning models can inherit hidden behavioral traits through innocuous public interfaces, a phenomenon known as subliminal learning. Here we extend this framework to quantum models and study two distillation pathways: an auxiliary channel on random inputs and a restricted task channel in which the student matches a public supervised output while the hidden behavior resides on a disjoint task. Both classical and quantum neural networks (QNNs) exhibit efficient auxiliary-channel subliminal learning, but the task channel shows strong architecture dependence. Classical neural networks transmit little hidden-task information through the public-task interface, whereas QNNs retain most of the hidden-task signal. We show that a unified geometric picture explains both regimes: transmission is controlled by the teacher drift magnitude together with the fraction of hidden-task-relevant drift that remains visible through the public interface. These results identify a concrete security concern for quantum model supply chains and suggest a controlled route for hidden-information transfer in quantum information processing.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript extends subliminal learning to quantum neural networks via two distillation pathways (auxiliary channel on random inputs; restricted task channel with disjoint hidden behavior). Both classical NNs and QNNs exhibit efficient auxiliary-channel transmission, but the restricted task channel reveals strong architecture dependence: classical networks transmit little hidden-task information while QNNs retain most of the signal. A unified geometric construction—teacher drift magnitude together with the fraction of hidden-task-relevant drift visible through the public interface—is offered to account for both regimes and to identify a security concern for quantum model supply chains.

Significance. If the geometric picture can be shown to be predictive (i.e., the visible-drift fraction computed independently of the measured transmission rates), the result would constitute a concrete, architecture-specific security finding for quantum ML supply chains and a controlled mechanism for hidden-information transfer. The explicit contrast between classical and quantum behavior under the same public interface is the load-bearing claim.

major comments (2)
  1. [geometric picture / results] The geometric picture (abstract and results section) asserts that transmission is controlled by teacher drift magnitude plus the visible fraction of hidden-task-relevant drift. If this fraction is extracted from the same output statistics used to quantify transmission rates rather than from an a-priori property of the public-task loss surface or model class, the construction is descriptive rather than predictive and cannot establish that the architecture dependence is explained rather than fitted post hoc.
  2. [methods / distillation pathways] The two distillation pathways are presented as representative, yet no quantitative justification is given that the restricted task channel (public supervised output, hidden behavior on disjoint task) captures realistic model-distillation scenarios without confounding factors such as regularization, measurement noise, or architecture-specific inductive biases.
minor comments (2)
  1. [abstract / introduction] Notation for the two drift parameters should be introduced with explicit definitions and units before the geometric picture is invoked.
  2. [figures] Figure captions should state the precise numerical values of teacher drift magnitude and visible fraction used in each panel.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. We address the two major comments below, clarifying the independence of the geometric construction and providing additional context on the distillation pathways. Revisions will be made to strengthen the predictive aspects and add discussion of the experimental design.

read point-by-point responses
  1. Referee: [geometric picture / results] The geometric picture (abstract and results section) asserts that transmission is controlled by teacher drift magnitude plus the visible fraction of hidden-task-relevant drift. If this fraction is extracted from the same output statistics used to quantify transmission rates rather than from an a-priori property of the public-task loss surface or model class, the construction is descriptive rather than predictive and cannot establish that the architecture dependence is explained rather than fitted post hoc.

    Authors: We thank the referee for this important clarification on predictive versus descriptive modeling. The visible fraction in our geometric construction is computed from the a-priori geometry of the public-task loss surface and the model class: specifically, the projection of the hidden-task drift vector onto the subspace spanned by the public-task gradients, which depends only on the architecture and the public interface definition. This quantity is obtained before any hidden-task transmission measurements are performed. The teacher drift magnitude is likewise measured on the public task. Transmission rates on the hidden task are then compared against the prediction from these two quantities. We will revise the results section to include an explicit, independent calculation of the visible fraction for each architecture and demonstrate its use in predicting the observed transmission rates. revision: yes

  2. Referee: [methods / distillation pathways] The two distillation pathways are presented as representative, yet no quantitative justification is given that the restricted task channel (public supervised output, hidden behavior on disjoint task) captures realistic model-distillation scenarios without confounding factors such as regularization, measurement noise, or architecture-specific inductive biases.

    Authors: The restricted task channel is introduced to isolate the effect of information transfer through a public supervised interface while keeping the hidden behavior on a completely disjoint task; this design choice deliberately removes direct task overlap that could confound the measurement. Standard training protocols are used without additional regularization beyond convergence requirements, and the quantum simulations incorporate realistic measurement models. We acknowledge that real-world distillation pipelines may include further factors, but the pathway is chosen to reveal the architecture dependence under controlled conditions. We will add a dedicated paragraph in the methods section discussing these design choices, the mitigation of the listed confounding factors, and the scope of the claim regarding realistic scenarios. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The abstract and provided context present the geometric picture as a post-experiment explanation for observed architecture-dependent transmission differences in the two distillation pathways. No equations, self-citations, or derivations are quoted that reduce the visible-drift fraction or teacher-drift magnitude to the transmission statistics by construction. The central claim retains independent experimental content distinguishing QNN retention from classical near-zero transmission, satisfying the requirement for self-contained derivation against external benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The geometric picture and the two distillation pathways rest on domain assumptions about how drift propagates through interfaces; no free parameters are explicitly named in the abstract but the drift magnitude and visible fraction function as controlling quantities that may be fitted or chosen.

free parameters (2)
  • teacher drift magnitude
    Controls the overall scale of information transmission in the geometric model.
  • fraction of hidden-task-relevant drift visible through public interface
    Determines how much hidden information leaks in the task-channel regime.
axioms (1)
  • domain assumption The geometric drift picture unifies auxiliary-channel and task-channel regimes for both classical and quantum networks
    Invoked to explain the architecture dependence observed in the task channel.

pith-pipeline@v0.9.1-grok · 5674 in / 1412 out tokens · 38527 ms · 2026-06-29T07:10:58.283651+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

54 extracted references · 8 canonical work pages · 3 internal anchors

  1. [1]

    Distilling the Knowledge in a Neural Network

    G. Hinton, O. Vinyals, and J. Dean, Distilling the knowledge in a neural network, arXiv preprint arXiv:1503.02531 (2015)

  2. [2]

    Romero, N

    A. Romero, N. Ballas, S. Ebrahimi Kahou, A. Chassang, C. Gatta, and Y. Bengio, Fitnets: Hints for thin deep nets, inInternational Conference on Learning Represen- tations(2015)

  3. [3]

    J. Gou, B. Yu, S. J. Maybank, and D. Tao, Knowledge distillation: A survey, International Journal of Computer Vision129, 1789 (2021)

  4. [4]

    A.Mari, T.R.Bromley, J.Izaac, M.Schuld,andN.Killo- ran, Transfer learning in hybrid classical-quantum neural networks, Quantum4, 340 (2020)

  5. [5]

    L. Li, Y. Wang, J. Fan, J. Li, S. Qin, Q. Wen, and F. Gao, Quantum knowledge distillation for large language mod- els, arXiv preprint arXiv:2505.13205 (2025)

  6. [6]

    Cloud, M

    A. Cloud, M. Le, J. Chua, J. Betley, A. Sztyber-Betley, S. Mindermann, J. Hilton, S. Marks, and O. Evans, Lan- guage models transmit behavioural traits through hidden signals in data, Nature652, 615 (2026)

  7. [7]

    Biggio, B

    B. Biggio, B. Nelson, and P. Laskov, Poisoning at- tacks against support vector machines, inProceedings of the 29th International Conference on Machine Learning (2012) pp. 1807–1814

  8. [8]

    T. Gu, K. Liu, B. Dolan-Gavitt, and S. Garg, Badnets: Evaluating backdooring attacks on deep neural networks, IEEE Access7, 47230 (2019)

  9. [9]

    X. Chen, C. Liu, B. Li, K. Lu, and D. Song, Targeted backdoor attacks on deep learning systems using data poisoning, arXiv preprint arXiv:1712.05526 (2017)

  10. [10]

    A. Saha, A. Subramanya, and H. Pirsiavash, Hidden trig- ger backdoor attacks, inProceedings of the AAAI Confer- ence on Artificial Intelligence, Vol. 34 (2020) pp. 11957– 11965

  11. [11]

    Y. Li, Y. Jiang, Z. Li, and S.-T. Xia, Backdoor learning: A survey, IEEE Transactions on Neural Networks and Learning Systems35, 5 (2024)

  12. [12]

    Shafahi, W

    A. Shafahi, W. R. Huang, M. Najibi, O. Suciu, C. Studer, T. Dumitras, and T. Goldstein, Poison frogs! targeted clean-label poisoning attacks on neural networks, inAd- vances in neural information processing systems, Vol. 31 (2018)

  13. [13]

    Goldblum, D

    M. Goldblum, D. Tsipras, C. Xie, X. Chen, A. Schwarzschild, D. Song, A. Madry, B. Li, and T. Goldstein, Dataset security for machine learn- ing: Data poisoning, backdoor attacks, and defenses, IEEE Transactions on Pattern Analysis and Machine Intelligence45, 1563 (2023)

  14. [14]

    Tramèr, F

    F. Tramèr, F. Zhang, A. Juels, M. K. Reiter, and T. Ris- tenpart, Stealing machine learning models via prediction APIs, in25th USENIX Security Symposium(2016) pp. 601–618

  15. [15]

    Papernot, P

    N. Papernot, P. McDaniel, I. Goodfellow, S. Jha, Z. B. Celik, and A. Swami, Practical black-box attacks against machine learning, inProceedings of the 2017 ACM on Asia Conference on Computer and Communications Se- curity(2017) pp. 506–519

  16. [16]

    Lu, L.-M

    S. Lu, L.-M. Duan, and D.-L. Deng, Quantum adversarial machine learning, Physical Review Research2, 033212 6 (2020)

  17. [17]

    Liu and P

    N. Liu and P. Wittek, Vulnerability of quantum classi- fication to adversarial perturbations, Physical Review A 101, 062331 (2020)

  18. [18]

    W. Ren, W. Li, S. Xu, K. Wang, W. Jiang, F. Jin, X. Zhu, J. Chen, Z. Song, P. Zhang, H. Dong, X. Zhang, J. Deng, Y. Gao, C. Zhang, Y. Wu, B. Zhang, Q. Guo, H. Li, Z. Wang, J. Biamonte, C. Song, D.-L. Deng, and H. Wang, Experimental quantum adversarial learn- ing with programmable superconducting qubits, Nature Computational Science2, 711 (2022)

  19. [19]

    Chen and S.-X

    Y.-Q. Chen and S.-X. Zhang, Superior resilience to poi- soning and amenability to unlearning in quantum ma- chine learning, Nature Communications17, 3716 (2026)

  20. [20]

    J.Betley, N.Warncke, A.Sztyber-Betley, D.Tan, X.Bao, M. Soto, M. Srivastava, N. Labenz, and O. Evans, Train- ing large language models on narrow tasks can lead to broad misalignment, Nature649, 584 (2026)

  21. [21]

    G. J. Simmons, The prisoners’ problem and the sublimi- nal channel, inAdvances in Cryptology(Springer, 1984) pp. 51–67

  22. [22]

    R. J. Anderson and F. A. Petitcolas, On the limits of steganography, IEEE Journal on selected areas in com- munications16, 474 (1998)

  23. [23]

    Peruzzo, J

    A. Peruzzo, J. McClean, P. Shadbolt, M.-H. Yung, X.-Q. Zhou, P. J. Love, A. Aspuru-Guzik, and J. L. O’Brien, A variational eigenvalue solver on a photonic quantum processor, Nature Communications5, 4213 (2014)

  24. [24]

    Biamonte, P

    J. Biamonte, P. Wittek, N. Pancotti, P. Rebentrost, N. Wiebe, and S. Lloyd, Quantum machine learning, Na- ture549, 195 (2017)

  25. [25]

    Cerezo, A

    M. Cerezo, A. Arrasmith, R. Babbush, S. C. Benjamin, S. Endo, K. Fujii, J. R. McClean, K. Mitarai, X. Yuan, L. Cincio, and P. J. Coles, Variational quantum algo- rithms, Nature Reviews Physics3, 625 (2021)

  26. [26]

    Bharti, A

    K. Bharti, A. Cervera-Lierta, T. H. Kyaw, T. Haug, S. Alperin-Lea, A. Anand, M. Degroote, H. Heimonen, J. S. Kottmann, T. Menke, W.-K. Mok, S. Sim, L.-C. Kwek, and A. Aspuru-Guzik, Noisy intermediate-scale quantum algorithms, Reviews of Modern Physics94, 015004 (2022)

  27. [27]

    Mitarai, M

    K. Mitarai, M. Negoro, M. Kitagawa, and K. Fujii, Quantum circuit learning, Physical Review A98, 032309 (2018)

  28. [28]

    Benedetti, E

    M. Benedetti, E. Lloyd, S. Sack, and M. Fiorentini, Pa- rameterized quantum circuits as machine learning mod- els, Quantum Science and Technology4, 043001 (2019)

  29. [29]

    K. Beer, D. Bondarenko, T. Farrelly, T. J. Osborne, R. Salzmann, D. Scheiermann, and R. Wolf, Training deep quantum neural networks, Nature Communications 11, 808 (2020)

  30. [30]

    Chen and S.-X

    Y.-Q. Chen and S.-X. Zhang, Intrinsic preservation of plasticity in continual quantum learning, arXiv preprint arXiv:2511.17228 (2025)

  31. [31]

    Schuld and N

    M. Schuld and N. Killoran, Quantum machine learning in feature hilbert spaces, Physical review letters122, 040504 (2019)

  32. [32]

    Havlíček, A

    V. Havlíček, A. D. Córcoles, K. Temme, A. W. Harrow, A. Kandala, J. M. Chow, and J. M. Gambetta, Super- vised learning with quantum-enhanced feature spaces, Nature567, 209 (2019)

  33. [33]

    Pérez-Salinas, A

    A. Pérez-Salinas, A. Cervera-Lierta, E. Gil-Fuster, and J. I. Latorre, Data re-uploading for a universal quantum classifier, Quantum4, 226 (2020)

  34. [34]

    Schuld, R

    M. Schuld, R. Sweke, and J. J. Meyer, Effect of data encoding on the expressive power of variational quantum- machine-learningmodels,PhysicalReviewA103,032430 (2021)

  35. [35]

    Schatzki, A

    L. Schatzki, A. Arrasmith, P. J. Coles, and M. Cerezo, Entangled datasets for quantum machine learning, arXiv preprint arXiv:2109.03400 (2021)

  36. [36]

    Abbas, D

    A. Abbas, D. Sutter, C. Zoufal, A. Lucchi, A. Figalli, and S. Woerner, The power of quantum neural networks, Nature Computational Science1, 403 (2021)

  37. [37]

    M. C. Caro, H.-Y. Huang, M. Cerezo, K. Sharma, A. Sornborger, L. Cincio, and P. J. Coles, Generalization in quantum machine learning from few training data, Na- ture communications13, 4919 (2022)

  38. [38]

    S. Sim, P. D. Johnson, and A. Aspuru-Guzik, Express- ibility and entangling capability of parameterized quan- tumcircuitsforhybridquantum-classicalalgorithms,Ad- vanced Quantum Technologies2, 1900070 (2019)

  39. [39]

    Huang, M

    H.-Y. Huang, M. Broughton, M. Mohseni, R. Babbush, S. Boixo, H. Neven, and J. R. McClean, Power of data in quantum machine learning, Nature communications12, 2631 (2021)

  40. [40]

    T. Haug, K. Bharti, and M. Kim, Capacity and quan- tum geometry of parametrized quantum circuits, PRX Quantum2, 040309 (2021)

  41. [41]

    J. R. McClean, S. Boixo, V. N. Smelyanskiy, R. Bab- bush, and H. Neven, Barren plateaus in quantum neural network training landscapes, Nature communications9, 4812 (2018)

  42. [42]

    Zhang, J

    S.-X. Zhang, J. Allcock, Z.-Q. Wan, S. Liu, J. Sun, H. Yu, X.-H. Yang, J. Qiu, Z. Ye, Y.-Q. Chen, C.-K. Lee, Y.-C. Zheng, S.-K. Jian, H. Yao, C.-Y. Hsieh, and S. Zhang, Tensorcircuit: a quantum software framework for the NISQ era, Quantum7, 912 (2023)

  43. [43]

    Zhang, Y.-Q

    S.-X. Zhang, Y.-Q. Chen, W. Li, J. Sun, W.-G. Ma, P.-L. Zheng, Y.-X. Huang, Q.-X. Wang, H. Yu, Z. Li, X. Huang, Z.-L. Li, Z.-Q. Wan, S. Liu, J. Qiu, J. Miao, Z. Song, Y. Yan, K. Tsuoka, P. Zhang, L. Wang, H. Fan, C.-Y. Hsieh, H. Yao, and T. Xiang, Tensorcircuit-ng: A universal, composable, and scalable platform for quan- tum computing and quantum simulati...

  44. [44]

    Hayden and J

    P. Hayden and J. Preskill, Black holes as mirrors: quan- tum information in random subsystems, Journal of High Energy Physics2007, 120 (2007)

  45. [45]

    X. Mi, P. Roushan, C. Quintana, S. Mandrà, J. Mar- shall, C. Neill, F. Arute, K. Arya, J. Atalaya, R. Bab- bush, J. C. Bardin, R. Barends, J. Basso, A. Bengts- son, S. Boixo, A. Bourassa, M. Broughton, B. B. Buck- ley, D. A. Buell, B. Burkett, N. Bushnell, Z. Chen, B. Chiaro, R. Collins, W. Courtney, S. Demura, A. R. Derk, A. Dunsworth, D. Eppens, C. Eric...

  46. [46]

    Y.-Q. Chen, S. Liu, and S.-X. Zhang, Subsystem infor- mation capacity in random circuits and hamiltonian dy- namics, Quantum9, 1783 (2025)

  47. [47]

    M. A. Nielsen and I. L. Chuang,Quantum computation and quantum information(Cambridge university press, 2010)

  48. [48]

    A. G. Fowler, M. Mariantoni, J. M. Martinis, and A. N. Cleland, Surface codes: Towards practical large-scale quantum computation, Physical Review A86, 032324 (2012)

  49. [49]

    Romero, J

    J. Romero, J. P. Olson, and A. Aspuru-Guzik, Quantum autoencoders for efficient compression of quantum data, Quantum Science and Technology2, 045001 (2017)

  50. [50]

    The information bottleneck method

    N. Tishby, F. C. Pereira, and W. Bialek, The information bottleneck method, arXiv preprint arXiv:physics/0004057 (2000)

  51. [51]

    Goldfeld and Y

    Z. Goldfeld and Y. Polyanskiy, The information bottle- neck problem and its applications in machine learning, IEEE Journal on Selected Areas in Information Theory 1, 19 (2020)

  52. [52]

    M. Caro, T. Gur, C. Rouzé, D. Stilck França, and S. Subramanian, Information-theoretic generalization bounds for learning from quantum data, arXiv preprint arXiv:2311.05529 (2023)

  53. [53]

    Preskill, Quantum computing in the NISQ era and beyond, Quantum2, 79 (2018)

    J. Preskill, Quantum computing in the NISQ era and beyond, Quantum2, 79 (2018)

  54. [54]

    Quantum Subliminal Learning

    B. A. Shaw and T. A. Brun, Quantum steganography with noisy quantum channels, Physical Review A83, 022310 (2011). 8 Supplemental Material for “Quantum Subliminal Learning” S1. SUPPLEMENT AR Y T ASK-CHANNEL TRENDS Fig. S1 collects the task-channel sweeps that support the regime interpretation used in the main text. The classical and quantum teacher-learnin...