QKAN: quantum Kolmogorov-Arnold networks with applications in machine learning and multivariate state preparation
Pith reviewed 2026-05-23 20:11 UTC · model grok-4.3
The pith
QKAN constructs wide-and-shallow quantum networks from recursive block-encodings to support both learning models and multivariate state preparation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
QKAN is a quantum algorithmic framework based on block-encodings constructed recursively from a single layer using quantum singular value transformation. It functions as a learning model by taking eigenvalues of block-encoded matrices as neurons and placing parametrized activation functions on network edges. The same structure serves as a multivariate quantum state-preparation protocol for functions with shallow compositional structure, demonstrated by preparing a multivariate Gaussian state in two layers. The architecture is wide-and-shallow, with shallow depth compensated by exponentially wide layers when efficient block-encodings of inputs are available. Parametrization and training occur
What carries the argument
Block-encodings constructed recursively from a single layer using quantum singular value transformation, which carries the compositional structure and enables the wide-and-shallow scaling.
If this is right
- QKAN serves as a quantum learning model whose training uses parametrized quantum circuits together with quantum linear algebra subroutines.
- A two-layer QKAN prepares a multivariate Gaussian quantum state with resources set by the compositional depth rather than the input dimension.
- Shallow depth is offset by exponentially wide layers precisely when efficient block-encodings exist for the inputs.
- The modular, compositional design opens new applications in quantum machine learning and quantum state preparation.
Where Pith is reading between the lines
- If block-encodings prove efficient for broad classes of functions, QKAN could reduce the circuit depth required for approximation tasks compared with standard variational approaches.
- The recursive block-encoding construction may combine with other quantum algorithms that already rely on block-encodings for linear-algebra primitives.
- Hybrid training loops could alternate between classical optimization of the activation parameters and quantum evaluation of the block-encoded layers.
Load-bearing premise
Efficient block-encodings of the input matrices or functions are available so that exponential width can compensate for shallow depth.
What would settle it
Implement the two-layer QKAN for multivariate Gaussian state preparation on a quantum device and check whether the output state fidelity reaches the target with circuit depth and width scaling as claimed.
Figures
read the original abstract
We introduce quantum Kolmogorov-Arnold networks (QKAN), a quantum algorithmic framework inspired by the recently proposed Kolmogorov-Arnold Networks (KAN). QKAN inherits the compositional structure of KAN and is based on block-encodings, constructed recursively from a single layer using quantum singular value transformation. We demonstrate the algorithmic utility of QKAN in two applications. First, we introduce and analyze QKAN as a quantum learning model, treating the eigenvalues of block-encoded matrices as neurons and applying parametrized activation functions on the edges of the network. We show that QKAN is a wide-and-shallow neural architecture, where shallow depth is compensated by exponentially wide layers whenever efficient block-encodings of inputs are available. We further discuss how to parametrize and train QKAN using parametrized quantum circuits and quantum linear algebra subroutines. Second, we demonstrate that QKAN can serve as a multivariate quantum state-preparation protocol for functions with shallow compositional structure. We demonstrate this by efficiently preparing a multivariate Gaussian quantum state using a two-layer QKAN. Looking forward, we anticipate that QKAN's compositional and modular design will enable new applications in quantum machine learning and quantum state preparation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces quantum Kolmogorov-Arnold networks (QKAN), a framework that inherits the compositional structure of classical KANs and constructs them recursively from block-encodings via quantum singular value transformation (QSVT). It presents QKAN as a quantum learning model in which eigenvalues of block-encoded matrices act as neurons with parametrized activation functions on the edges, and demonstrates its use for multivariate quantum state preparation via a two-layer construction that prepares a Gaussian state. The central claim is that QKAN yields a wide-and-shallow architecture in which exponentially wide layers compensate for shallow depth whenever efficient block-encodings of the inputs are available; the manuscript discusses parametrization via parametrized quantum circuits and quantum linear-algebra subroutines.
Significance. If the claims hold, QKAN would supply a modular, compositional quantum architecture that directly transplants the edge-function flexibility of KANs into the quantum setting, potentially enabling new quantum-ML models and state-preparation protocols that exploit shallow compositional structure. The explicit two-layer Gaussian preparation constitutes a concrete, falsifiable example that can be checked for resource scaling; the recursive QSVT construction itself is a reusable primitive.
major comments (2)
- [Abstract and §4] Abstract and §4 (quantum learning model): the headline claim that 'shallow depth is compensated by exponentially wide layers whenever efficient block-encodings of inputs are available' is load-bearing for the wide-and-shallow characterization, yet the text invokes the premise for both the eigenvalue-neuron model and the state-preparation protocol without supplying a general construction or poly(log N) resource bound that would establish when such block-encodings exist for arbitrary input matrices or compositional functions.
- [§5.2] §5.2 (multivariate Gaussian state preparation): the two-layer QKAN is asserted to prepare the state 'efficiently,' but no explicit query-complexity or gate-count analysis is given that would confirm the exponential-width compensation actually materializes beyond the assumption of an efficient block-encoding; this leaves the concrete demonstration without a verifiable resource comparison to standard state-preparation methods.
minor comments (2)
- [§3] Notation for the recursive QSVT construction (likely §3) would benefit from an explicit diagram or pseudocode showing how a single-layer block-encoding is lifted to the multi-layer network; the current prose description leaves the width scaling implicit.
- [Introduction] The manuscript should cite the original KAN paper (Liu et al., 2024) and recent block-encoding/QSVT surveys in the introduction to situate the contribution.
Simulated Author's Rebuttal
We thank the referee for the constructive review and for highlighting the importance of clarifying the assumptions underlying our claims. We address each major comment below and will revise the manuscript accordingly to strengthen the presentation.
read point-by-point responses
-
Referee: [Abstract and §4] Abstract and §4 (quantum learning model): the headline claim that 'shallow depth is compensated by exponentially wide layers whenever efficient block-encodings of inputs are available' is load-bearing for the wide-and-shallow characterization, yet the text invokes the premise for both the eigenvalue-neuron model and the state-preparation protocol without supplying a general construction or poly(log N) resource bound that would establish when such block-encodings exist for arbitrary input matrices or compositional functions.
Authors: We agree that no general construction or poly(log N) bound is supplied for arbitrary input matrices, as the existence of efficient block-encodings is inherently problem-dependent and relies on specific structure (e.g., sparsity or oracle access). The manuscript's claim is explicitly conditional on such encodings being available, consistent with standard assumptions in QSVT and quantum linear algebra. We will revise the abstract and §4 to emphasize this conditionality, note that a universal construction lies outside the paper's scope, and clarify that the exponential width arises from the block-encoding dimension when the premise holds. revision: partial
-
Referee: [§5.2] §5.2 (multivariate Gaussian state preparation): the two-layer QKAN is asserted to prepare the state 'efficiently,' but no explicit query-complexity or gate-count analysis is given that would confirm the exponential-width compensation actually materializes beyond the assumption of an efficient block-encoding; this leaves the concrete demonstration without a verifiable resource comparison to standard state-preparation methods.
Authors: The efficiency claim for the Gaussian example rests on the assumption of an efficient block-encoding, without a detailed query or gate-count analysis in the current text. We will add an explicit discussion in §5.2 outlining the resource assumptions (including how the block-encoding dimension contributes to width) and a high-level comparison to standard state-preparation approaches, while noting that concrete numbers depend on the specific block-encoding implementation. revision: yes
Circularity Check
No significant circularity; derivation relies on external block-encoding premise and known QSVT.
full rationale
The paper defines QKAN via recursive construction from block-encodings using QSVT, inheriting KAN's compositional structure without reducing any target quantity (e.g., width compensation or state-preparation efficiency) to a fitted parameter or self-citation by definition. The 'wide-and-shallow' claim is explicitly conditioned on the external availability of efficient block-encodings rather than derived tautologically. No self-citations appear load-bearing, no ansatz is smuggled, and the Gaussian demonstration is presented as an explicit two-layer protocol without renaming or circular prediction. The chain is self-contained against the stated assumptions.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Efficient block-encodings of inputs are available
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Theorem 1 (CHEB-QKAN, informal). ... trainable activation functions are applied on edges ... linear combination of Chebyshev polynomials up to degree d ... O(d²L/2^L) applications of controlled block-encodings
-
IndisputableMonolith/Foundation/DimensionForcing.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
QKAN is a wide-and-shallow neural architecture, where shallow depth is compensated by exponentially wide layers whenever efficient block-encodings of inputs are available
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 8 Pith papers
-
Blind Catalytic Quantum Error Correction: Target-State Estimation and Fidelity Recovery Without A Priori Knowledge
Blind CQEC estimates target states from noisy outputs alone to recover quantum fidelity, achieving over 0.95 fidelity for small dimensions without noise-model knowledge via coherence maximization.
-
Blind Catalytic Quantum Error Correction: Target-State Estimation and Fidelity Recovery Without A Priori Knowledge
Blind CQEC estimates the target state from noisy data to recover fidelity without a priori knowledge, with a proven Lipschitz bound explaining linear correlation in fidelities and 3.4x error reduction in H2 VQE.
-
Merged amplitude encoding for Chebyshev quantum Kolmogorov--Arnold networks: trading qubits for circuit executions
Merged amplitude encoding reduces circuit executions in CCQKAN by a factor of n with 1-2 extra qubits and preserves trainability in numerical tests on networks and MNIST classification.
-
Quantum circuit complexity and unsupervised machine learning of topological order
Nielsen quantum circuit complexity is positioned as a topological distance for unsupervised learning of topological order, with theorems linking it to Bures distance and entanglement to yield practical fidelity- and e...
-
Catalytic Quantum Error Correction: Theory, Efficient Catalyst Preparation, and Numerical Benchmarks
Catalytic Quantum Error Correction recovers known target states from noisy copies with F > 0.96 using only eight copies by preserving coherent modes and applying a CPMG-Clifford-swap-test pipeline, bypassing magnitude...
-
The $\gamma_c$-Peak: Covariant Recovery on Four Organic Qubit Platforms
Four organic-material paths are outlined for magnetic-field-free quantum computing, with simulator benchmarks claiming significant error-correction gains and provable advantages over classical methods in toy-scale tasks.
-
The $\gamma_c$-Peak: Covariant Recovery on Four Organic Qubit Platforms
Simulations across four organic platforms show peak fidelity gains from covariant Petz recovery exactly at the entanglement-breaking threshold, with linear scaling in log d and significant advantages for several quant...
-
The $\gamma_c$-Peak: Covariant Recovery on Four Organic Qubit Platforms
Simulations across four organic qubit platforms show Petz recovery yields maximum fidelity gain at the entanglement-breaking threshold gamma_c, with Delta F max of 0.303 at dimension 64 and log2 d scaling.
Reference graph
Works this paper leans on
-
[1]
A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, in Advances in Neural Information Processing Systems , Vol. 25, edited by F. Pereira, C. Burges, L. Bottou, and 18 K. Weinberger (Curran Associates, Inc., 2012)
work page 2012
-
[2]
I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning (MIT Press, 2016)
work page 2016
- [3]
-
[4]
K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016) pp. 770–778
work page 2016
-
[5]
Cybenko, Approximation by superpositions of a sigmoidal function, Math
G. Cybenko, Approximation by superpositions of a sigmoidal function, Math. Control Signals Syst. 2, 303 (1989)
work page 1989
- [6]
-
[7]
Z. Liu, Y. Wang, S. Vaidya, F. Ruehle, J. Halverson, M. Soljaˇ ci´ c, T. Y. Hou, and M. Tegmark, KAN: Kolmogorov-Arnold networks (2024), arXiv:2404.19756 [cs.LG]
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[8]
A. Kolmogorov, On the representation of continuous functions of several variables by superpositions of contin- uous functions of a smaller number of variables, Dokl. Akad. Nauk SSSR 108, 179 (1956)
work page 1956
-
[9]
A. Kolmogorov, On the representation of continuous functions of many variables by superposition of continuous functions of one variable and addition, Dokl. Akad. Nauk SSSR 114, 953 (1957)
work page 1957
-
[10]
Arnold, On functions of three variables, Dokl
V. Arnold, On functions of three variables, Dokl. Akad. Nauk SSSR 114, 679 (1957)
work page 1957
-
[11]
V. Arnold, On the representation of continuous functions of three variables by superpositions of continuous functions of two variables, Mat. Sb. (N.S.) 48(90), 3 (1959)
work page 1959
- [12]
- [13]
-
[14]
M. Kiamari, M. Kiamari, and B. Krishnamachari, GKAN: Graph Kolmogorov-Arnold networks (2024), arXiv:2406.06470 [cs.LG]
- [15]
-
[16]
S. Sidharth, A. Keerthana, R. Gokul, and K. Anas, Chebyshev polynomial-based Kolmogorov-Arnold networks: An efficient architecture for nonlinear function approximation (2024), arXiv:2405.07200 [cs.LG]
-
[17]
Z. Bozorgasl and H. Chen, Wav-KAN: Wavelet Kolmogorov-Arnold networks (2024), arXiv:2405.12832 [cs.LG]
-
[18]
R. Genet and H. Inzirillo, A temporal Kolmogorov-Arnold Transformer for time series forecasting (2024), arXiv:2406.02486 [cs.LG]
-
[19]
Y. Wang, J. Sun, J. Bai, C. Anitescu, M. S. Eshaghi, X. Zhuang, T. Rabczuk, and Y. Liu, Kolmogorov Arnold Informed neural network: A physics-informed deep learning framework for solving forward and inverse problems based on Kolmogorov Arnold networks (2024), arXiv:2406.11045 [cs.LG]
- [20]
- [21]
- [22]
-
[23]
F. Zhang and X. Zhang, GraphKAN: Enhancing feature extraction with graph Kolmogorov Arnold networks (2024), arXiv:2406.13597 [cs.LG]
-
[24]
Preskill, Quantum computing in the NISQ era and beyond, Quantum 2, 79 (2018)
J. Preskill, Quantum computing in the NISQ era and beyond, Quantum 2, 79 (2018)
work page 2018
-
[25]
M. Schuld and F. Petruccione, Machine Learning with Quantum Computers (Springer Cham, 2021)
work page 2021
-
[26]
J. Biamonte, P. Wittek, N. Pancotti, P. Rebentrost, N. Wiebe, and S. Lloyd, Quantum machine learning, Nature 549, 195 (2016)
work page 2016
-
[27]
K. Beer, D. Bondarenko, T. Farrelly, T. J. Osborne, R. Salzmann, D. Scheiermann, and R. Wolf, Training deep quantum neural networks, Nat. Commun. 11, 808 (2020)
work page 2020
- [28]
-
[29]
V. Havl´ ıˇ cek, A. D. C´ orcoles, K. Temme, A. W. Harrow, A. Kandala, J. M. Chow, and J. M. Gambetta, Supervised learning with quantum-enhanced feature spaces, Nature 567, 209–212 (2019)
work page 2019
-
[30]
K. Mitarai, M. Negoro, M. Kitagawa, and K. Fujii, Quantum circuit learning, Phys. Rev. A 98, 032309 (2018)
work page 2018
-
[31]
Classification with Quantum Neural Networks on Near Term Processors
E. Farhi and H. Neven, Classification with quantum neural networks on near term processors (2018), arXiv:1802.06002 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[32]
M. Benedetti, E. Lloyd, S. Sack, and M. Fiorentini, Parameterized quantum circuits as machine learning models, Quantum Sci. Technol. 4, 043001 (2019)
work page 2019
-
[33]
M. Schuld and N. Killoran, Quantum machine learning in feature Hilbert spaces, Phys. Rev. Lett. 122, 040504 (2019). 19
work page 2019
- [34]
- [35]
- [36]
- [37]
- [38]
-
[39]
M. C. Caro, H.-Y. Huang, M. Cerezo, K. Sharma, A. Sornborger, L. Cincio, and P. J. Coles, Generalization in quantum machine learning from few training data, Nat. Commun. 13 (2022)
work page 2022
-
[40]
L. Pira and C. Ferrie, On the interpretability of quantum neural networks, Quantum Mach. Intell. 6 (2024)
work page 2024
-
[41]
J. R. McClean, S. Boixo, V. N. Smelyanskiy, R. Babbush, and H. Neven, Barren plateaus in quantum neural network training landscapes, Nat. Commun. 9 (2018)
work page 2018
- [42]
-
[43]
C. Ortiz Marrero, M. Kieferov´ a, and N. Wiebe, Entanglement-induced barren plateaus, PRX Quantum 2, 040316 (2021)
work page 2021
-
[44]
S. Wang, E. Fontana, M. Cerezo, K. Sharma, A. Sone, L. Cincio, and P. J. Coles, Noise-induced barren plateaus in variational quantum algorithms, Nat. Commun. 12 (2021)
work page 2021
- [45]
-
[46]
M. Cerezo, M. Larocca, D. Garc´ ıa-Mart´ ın, N. L. Diaz, P. Braccia, E. Fontana, M. S. Rudolph, P. Bermejo, A. Ijaz, S. Thanasilp, E. R. Anschuetz, and Z. Holmes, Does provable absence of barren plateaus imply classical simulability? or, why we need to rethink variational quantum computing (2023), arXiv:2312.09121 [quant-ph]
-
[47]
M. Larocca, S. Thanasilp, S. Wang, K. Sharma, J. Biamonte, P. J. Coles, L. Cincio, J. R. McClean, Z. Holmes, and M. Cerezo, A review of barren plateaus in variational quantum computing (2024), arXiv:2405.00781 [quant- ph]
-
[48]
P. Rebentrost, M. Mohseni, and S. Lloyd, Quantum support vector machine for big data classification, Phys. Rev. Lett. 113, 130503 (2014)
work page 2014
-
[49]
I. Kerenidis, J. Landman, and A. Prakash, Quantum algorithms for deep convolutional neural networks, in International Conference on Learning Representations (2020)
work page 2020
- [50]
-
[51]
J. Allcock, C.-Y. Hsieh, I. Kerenidis, and S. Zhang, Quantum algorithms for feedforward neural networks, ACM Trans. Quantum Comput. 1 (2020)
work page 2020
-
[52]
P. Rebentrost, T. R. Bromley, C. Weedbrook, and S. Lloyd, Quantum Hopfield neural network, Phys. Rev. A 98, 042308 (2018)
work page 2018
-
[53]
M. H. Amin, E. Andriyash, J. Rolfe, B. Kulchytskyy, and R. Melko, Quantum Boltzmann machine, Phys. Rev. X 8, 021050 (2018)
work page 2018
- [54]
-
[55]
G. H. Low and I. L. Chuang, Optimal Hamiltonian simulation by quantum signal processing, Phys. Rev. Lett. 118, 010501 (2017)
work page 2017
-
[56]
G. H. Low and I. L. Chuang, Hamiltonian simulation by qubitization, Quantum 3, 163 (2019)
work page 2019
-
[57]
A. Gily´ en, Y. Su, G. H. Low, and N. Wiebe, Quantum singular value transformation and beyond: exponential improvements for quantum matrix arithmetics, in Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing , STOC 2019 (Association for Computing Machinery, New York, NY, USA, 2019) pp. 193–204
work page 2019
-
[58]
P. Rall and B. Fuller, Amplitude estimation from quantum signal processing, Quantum 7, 937 (2023)
work page 2023
-
[59]
J. M. Martyn, Z. M. Rossi, A. K. Tan, and I. L. Chuang, Grand unification of quantum algorithms, PRX Quantum 2, 040203 (2021)
work page 2021
-
[60]
A. Gily´ en and A. Poremba, Improved quantum algorithms for fidelity estimation (2022), arXiv:2203.15993 [quant-ph]
-
[61]
Q. Wang, Z. Zhang, K. Chen, J. Guan, W. Fang, J. Liu, and M. Ying, Quantum algorithm for fidelity estimation, IEEE Trans. Inf. Theor. 69, 273 (2023). 20
work page 2023
-
[62]
Q. Wang, J. Guan, J. Liu, Z. Zhang, and M. Ying, New quantum algorithms for computing quantum entropies and distances, IEEE Trans. Inf. Theor. 70, 5653 (2024)
work page 2024
- [63]
- [64]
-
[65]
J. Braun and M. Griebel, On a constructive proof of Kolmogorov’s superposition theorem, Constr. Approx. 30, 653 (2009)
work page 2009
-
[66]
F. Girosi and T. Poggio, Representation properties of networks: Kolmogorov’s theorem is irrelevant, Neural Comput. 1, 465 (1989)
work page 1989
-
[67]
Akashi, Application of ϵ-entropy theory to Kolmogorov-Arnold representation theorem, Rep
S. Akashi, Application of ϵ-entropy theory to Kolmogorov-Arnold representation theorem, Rep. Math. Phys. 48, 19 (2001)
work page 2001
- [68]
- [69]
-
[70]
S. Chakraborty, A. Gily´ en, and S. Jeffery, The power of block-encoded matrix powers: Improved regression techniques via faster Hamiltonian simulation, in 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019) , Leibniz International Proceedings in Informatics (LIPIcs), Vol. 132, edited by C. Baier, I. Chatzigiannakis, P. Floc...
work page 2019
-
[71]
A. M. Dalzell, S. McArdle, M. Berta, P. Bienias, C.-F. Chen, A. Gily´ en, C. T. Hann, M. J. Kastoryano, E. T. Khabiboulline, A. Kubica, G. Salton, S. Wang, and F. G. S. L. Brand˜ ao, Quantum algorithms: A survey of applications and end-to-end complexities (2023), arXiv:2310.03011 [quant-ph]
- [72]
-
[73]
D. W. Berry, A. M. Childs, R. Cleve, R. Kothari, and R. D. Somma, Simulating Hamiltonian dynamics with a truncated Taylor series, Phys. Rev. Lett. 114, 090502 (2015)
work page 2015
-
[74]
G. H. Low and I. L. Chuang, Hamiltonian simulation by uniform spectral amplification (2017), arXiv:1707.05391 [quant-ph]
work page internal anchor Pith review Pith/arXiv arXiv 2017
- [75]
-
[76]
M. A. Nielsen and I. L. Chuang, Quantum Computation and Quantum Information (Cambridge University Press, 2010)
work page 2010
-
[77]
A. W. Harrow, A. Hassidim, and S. Lloyd, Quantum algorithm for linear systems of equations, Phys. Rev. Lett. 103, 150502 (2009)
work page 2009
-
[78]
Aaronson, Read the fine print, Nat
S. Aaronson, Read the fine print, Nat. Phys. 11, 291 (2015)
work page 2015
-
[79]
L. K. Grover, Synthesis of quantum superpositions by quantum computation, Phys. Rev. Lett. 85, 1334 (2000)
work page 2000
-
[80]
M. Plesch and C. Brukner, Quantum-state preparation with universal gate decompositions, Phys. Rev. A 83, 032302 (2011)
work page 2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.