A multi-task SAC RL model discovers control pulses, evolution time, and segment numbers for 51 open quantum Hamiltonians, achieving high fidelity state transfer with better robustness to noise than GRAPE.
When Does Adaptation Win? Scaling Laws for Meta-Learning in Quantum Control
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Quantum hardware suffers from intrinsic device heterogeneity and environmental drift, forcing practitioners to choose between suboptimal non-adaptive controllers or costly per-device recalibration. We derive a scaling law lower bound for meta-learning showing that the adaptation gain (expected fidelity improvement from task-specific gradient steps) saturates exponentially with gradient steps and scales linearly with task variance, providing a quantitative criterion for when adaptation justifies its overhead. Validation on quantum gate calibration shows negligible benefits for low-variance tasks but >40% fidelity gains on two-qubit gates under extreme out-of-distribution conditions (10$\times$ the training noise), with implications for reducing per-device calibration time on cloud quantum processors. Further validation on classical linear-quadratic control confirms these laws emerge from general optimization geometry rather than quantum-specific physics. We further introduce a few-shot pre-adaptation protocol that estimates the optimal adaptation budget from $N{=}3$-5 probe steps within 3-19% relative error across out-of-distribution regimes.
fields
quant-ph 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Adaptive Reinforcement Learning for Robust Open Quantum System Control: A Multi-Task Framework with Temporal Optimization
A multi-task SAC RL model discovers control pulses, evolution time, and segment numbers for 51 open quantum Hamiltonians, achieving high fidelity state transfer with better robustness to noise than GRAPE.