Quantum Subliminal Learning
read the original abstract
Machine learning models can inherit hidden behavioral traits through innocuous public interfaces, a phenomenon known as subliminal learning. Here we extend this framework to quantum models and study two distillation pathways: an auxiliary channel on random inputs and a restricted task channel in which the student matches a public supervised output while the hidden behavior resides on a disjoint task. Both classical and quantum neural networks (QNNs) exhibit efficient auxiliary-channel subliminal learning, but the task channel shows strong architecture dependence. Classical neural networks transmit little hidden-task information through the public-task interface, whereas QNNs retain most of the hidden-task signal. We show that a unified geometric picture explains both regimes: transmission is controlled by the teacher drift magnitude together with the fraction of hidden-task-relevant drift that remains visible through the public interface. These results identify a concrete security concern for quantum model supply chains and suggest a controlled route for hidden-information transfer in quantum information processing.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.