Reinforcement Learning Control of Quantum Error Correction

Aaron Lunt; Aaron Shorter; Aaron Szasz; Abeer Vaishnav; Adam Zalcman; Aditya Locharla; Agnetta Y. Cleland; Agustin Di Paolo; Alan Wing Lun Fung; Alec Eickbusch

arxiv: 2511.08493 · v4 · pith:6ZRUOO5Anew · submitted 2025-11-11 · 🪐 quant-ph

Reinforcement Learning Control of Quantum Error Correction

Volodymyr Sivak , Alexis Morvan , Michael Broughton , Rodrigo G. Corti\~nas , Johannes Bausch , Andrew W. Senior , Matthew Neeley , Alec Eickbusch

show 291 more authors

Noah Shutty Laleh Aghababaie Beni James S. Spencer Francisco J. H Heras Thomas Edlich Dmitry Abanin Amira Abbas Rajeev Acharya Georg Aigeldinger Ross Alcaraz Sayra Alcaraz Trond I. Andersen Markus Ansmann Frank Arute Kunal Arya Walt Askew Nikita Astrakhantsev Juan Atalaya Brian Ballard Joseph C. Bardin Hector Bates Andreas Bengtsson Majid Bigdeli Karimi Alexander Bilmes Simon Bilodeau Felix Borjans Alexandre Bourassa Jenna Bovaird Dylan Bowers Leon Brill Peter Brooks David A. Browne Brett Buchea Bob B. Buckley Tim Burger Brian Burkett Nicholas Bushnell Jamal Busnaina Anthony Cabrera Juan Campero Hung-Shen Chang Silas Chen Ben Chiaro Liang-Ying Chih Agnetta Y. Cleland Bryan Cochrane Matt Cockrell Josh Cogan Roberto Collins Paul Conner Harold Cook William Courtney Alexander L. Crook Ben Curtin Martin Damyanov Sayan Das Dripto M. Debroy Sean Demura Paul Donohoe Ilya Drozdov Andrew Dunsworth Valerie Ehimhen Aviv Moshe Elbag Lior Ella Mahmoud Elzouka David Enriquez Catherine Erickson Vinicius S. Ferreira Marcos Flores Leslie Flores Burgos Ebrahim Forati Jeremiah Ford Austin G. Fowler Brooks Foxen Masaya Fukami Alan Wing Lun Fung Lenny Fuste Suhas Ganjam Gonzalo Garcia Christopher Garrick Robert Gasca Helge Gehring Robert Geiger \'Elie Genois William Giang Dar Gilboa James E. Goeders Edward C. Gonzales Raja Gosula Stijn J. de Graaf Alejandro Grajales Dau Dietrich Graumann Joel Grebel Alex Greene Jonathan A. Gross Jose Guerrero Lo\"ick Le Guevel Tan Ha Steve Habegger Tanner Hadick Ali Hadjikhani Michael C. Hamilton Matthew P. Harrigan Sean D. Harrington Jeanne Hartshorn Stephen Heslin Paula Heu Oscar Higgott Reno Hiltermann Hsin-Yuan Huang Mike Hucka Christopher Hudspeth Ashley Huff William J. Huggins Evan Jeffrey Shaun Jevons Zhang Jiang Xiaoxuan Jin Chaitali Joshi Pavol Juhas Andreas Kabel Dvir Kafri Hui Kang Kiseo Kang Amir H. Karamlou Ryan Kaufman Kostyantyn Kechedzhi Tanuj Khattar Mostafa Khezri Seon Kim Can M. Knaut Bryce Kobrin Fedor Kostritsa John Mark Kreikebaum Ryuho Kudo Ben Kueffler Arun Kumar Vladislav D. Kurilovich Vitali Kutsko Nathan Lacroix David Landhuis Tiano Lange-Dei Brandon W. Langley Pavel Laptev Kim-Ming Lau Justin Ledford Joy Lee Kenny Lee Brian J. Lester Wendy Leung Lily Li Wing Yan Li Ming Li Alexander T. Lill William P. Livingston Matthew T. Lloyd Aditya Locharla Laura De Lorenzo Daniel Lundahl Aaron Lunt Sid Madhuk Aniket Maiti Ashley Maloney Salvatore Mandr\`a Leigh S. Martin Orion Martin Eric Mascot Paul Masih Das Dmitri Maslov Melvin Mathews Cameron Maxfield Jarrod R. McClean Matt McEwen Seneca Meeks Kevin C. Miao Zlatko K. Minev Reza Molavi Sebastian Molina Shirin Montazeri Charles Neill Michael Newman Anthony Nguyen Murray Nguyen Chia-Hung Ni Murphy Yuezhen Niu Logan Oas Raymond Orosco Kristoffer Ottosson Alice Pagano Agustin Di Paolo Sherman Peek David Peterson Alex Pizzuto Elias Portoles Rebecca Potter Orion Pritchard Michael Qian Chris Quintana Arpit Ranadive Matthew J. Reagor Rachel Resnick David M. Rhodes Daniel Riley Gabrielle Roberts Roberto Rodriguez Emma Ropes Lucia B. De Rose Eliott Rosenberg Emma Rosenfeld Dario Rosenstock Elizabeth Rossi Pedram Roushan David A. Rower Robert Salazar Kannan Sankaragomathi Murat Can Sarihan Kevin J. Satzinger Max Schaefer Sebastian Schroeder Henry F. Schurkus Aria Shahingohar Michael J. Shearn Aaron Shorter Vladimir Shvarts Spencer Small W. Clarke Smith David A. Sobel Barrett Spells Sofia Springer George Sterling Jordan Suchard Aaron Szasz Alexander Sztein Madeline Taylor Jothi Priyanka Thiruraman Douglas Thor Dogan Timucin Eifu Tomita Alfredo Torres M. Mert Torunbalci Hao Tran Abeer Vaishnav Justin Vargas Sergey Vdovichev Guifre Vidal Catherine Vollgraff Heidweiller Meghan Voorhees Steven Waltman Jonathan Waltz Shannon X. Wang Brayden Ware James D. Watson Yonghua Wei Travis Weidel Theodore White Kristi Wong Bryan W. K. Woo Christopher J. Wood Maddy Woodson Cheng Xing Z. Jamie Yao Ping Yeh Bicheng Ying Juhwan Yoo Noureldin Yosri Elliot Young Grayson Young Adam Zalcman Ran Zhang Yaxing Zhang Ningfeng Zhu Nicholas Zobrist Zhenjie Zou Ryan Babbush Dave Bacon Sergio Boixo Yu Chen Zijun Chen Michel Devoret Monica Hansen Jeremy Hilton Cody Jones Julian Kelly Alexander N. Korotkov Erik Lucero Anthony Megrant Hartmut Neven William D. Oliver Ganesh Ramachandran Vadim Smelyanskiy Paul V. Klimov

This is my paper

classification 🪐 quant-ph

keywords quantumcontrolerrorcomputationcomputerlearninglogicalparameters

0 comments

read the original abstract

Quantum error correction (QEC) is the primary strategy for protecting a quantum computer from the environment. Its prerequisite is that errors must remain sufficiently rare, which requires perpetually adapting the computer's control parameters to the drifting environment conditions. The current solution to this problem is to terminate the entire quantum computation for recalibration, but it is incompatible with the long runtimes of future quantum algorithms. We address this challenge by unifying calibration with computation. We grant the QEC process a dual role: its error detection events are not only used to correct the logical quantum state, but are also repurposed as a learning signal, teaching a reinforcement learning (RL) agent to continuously steer the control parameters and stabilize the quantum system during computation. We experimentally demonstrate this framework on a Willow superconducting processor, improving the logical stability of the surface code 3.5-fold against injected drift. By synthesizing our full suite of technological advances, we achieve record performance of the surface and color codes, with average logical error per cycle of $7.72(9)\times10^{-4}$ and $8.19(14)\times10^{-3}$ respectively. Numerical simulations of large codes with tens of thousands of control parameters confirm the scalability of our RL framework, revealing an optimization speed that is independent of system size. This work thus enables a new paradigm: a quantum computer that learns from its errors and never stops computing.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Classical State Preparation for Variational Quantum Algorithms via Reinforcement Learning
quant-ph 2026-05 unverdicted novelty 7.0

CRiSP uses neural-guided MCTS and curriculum learning to insert Clifford prefixes before parameterized rotations in VQAs, yielding mean 3.17x and max 45x gains in energy accuracy on 22-qubit QAOA benchmarks versus pri...
FTPrimitiveBench: A Benchmark Suite For Logical Computation Under Hardware-Motivated and Biased Noise Models
quant-ph 2026-05 accept novelty 6.0

FTPrimitiveBench is a new benchmark suite for testing surface-code logical primitives under Pauli-biased, measurement-biased, and spatially non-uniform noise models, revealing that noise structure interacts distinctly...
High-fidelity entangling gates and nonlocal circuits with neutral atoms
quant-ph 2026-04 conditional novelty 6.0

Neutral-atom system delivers state-of-the-art CZ gate fidelity of 99.854% (99.941% postselected) and demonstrates coherent rearrangement for nonlocal quantum circuits.
FTPrimitiveBench: A Benchmark Suite For Logical Computation Under Hardware-Motivated and Biased Noise Models
quant-ph 2026-05 conditional novelty 5.0

FTPrimitiveBench is an open-source pipeline that connects parameterized hardware-motivated noise models to surface-code logical primitive circuits, enabling reproducible cross-primitive QEC benchmarking under Pauli bi...
High-Coherence and High-frequency Quantum Computing: The Design of a High-Frequency, High-Coherence and Scalable Quantum Computing Architecture
quant-ph 2026-01 unverdicted novelty 4.0

The paper proposes an 8-qubit transmon design at 12 GHz targeting 1.9 ms relaxation times and quality factors of 2.75e7 via tantalum and Nb/Al/AlOx fabrication on silicon.