Kolmogorov-Arnold Reservoir Computing

Juntian Huang; Jurgen Kurths; Ying Tang

arxiv: 2606.19984 · v3 · pith:3UKF6R2Xnew · submitted 2026-06-18 · 💻 cs.LG

Kolmogorov-Arnold Reservoir Computing

Juntian Huang , Jurgen Kurths , Ying Tang This is my paper

Pith reviewed 2026-06-30 10:49 UTC · model grok-4.3

classification 💻 cs.LG

keywords reservoir computingKolmogorov-Arnold networksdynamical systemsbasis function expansionsforecastingpartial differential equationsmachine learning

0 comments

The pith

Kolmogorov-Arnold Reservoir Computing replaces reservoirs with basis-function expansions to combine KAN expressivity with closed-form reservoir training.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops Kolmogorov-Arnold Reservoir Computing by substituting conventional reservoirs with explicit basis-function expansions drawn from the Kolmogorov-Arnold representation theorem. This substitution is shown to preserve the expressive capacity claimed for Kolmogorov-Arnold networks while enabling the efficient closed-form training that defines reservoir computing. The result addresses limitations of standard reservoir methods in long-range dependencies and of next-generation variants in exploding feature dimensions. If the substitution holds, high-fidelity forecasting of dynamical systems becomes possible at lower cost, with demonstrated gains on partial differential equation benchmarks and compatibility with diffusion models for generation tasks.

Core claim

KARC replaces reservoirs with explicit basis-function expansions inspired by the Kolmogorov-Arnold representation theorem. We rigorously show that KARC is a lightweight design of Kolmogorov-Arnold networks (KANs), preserving the potential expressive capacity of KANs while admitting efficient closed-form training of reservoir computing. At comparable cost, KARC outperforms existing reservoir computing methods on challenging benchmarks including partial differential equations. It can also be integrated with generative diffusion models for text-to-image generation.

What carries the argument

Explicit basis-function expansions inspired by the Kolmogorov-Arnold representation theorem, used in place of recurrent reservoirs to carry the network's representational capacity.

If this is right

At comparable cost, KARC outperforms existing reservoir computing methods on challenging benchmarks including partial differential equations.
It can be integrated with generative diffusion models for text-to-image generation.
This establishes a principled bridge between reservoir computing and KANs, enabling efficient and high-fidelity dynamical system forecasting.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same basis expansions could be tested as drop-in replacements in other recurrent architectures that currently rely on random or trained reservoirs.
Because training remains closed-form, KARC may scale more readily to very long time series than gradient-based KAN variants.
Integration with diffusion models opens the possibility of using KARC as a temporal prior inside generative pipelines for physics-informed image or video synthesis.

Load-bearing premise

The basis-function expansions inspired by the Kolmogorov-Arnold representation theorem can be substituted for conventional reservoirs without loss of the expressive capacity that KANs are claimed to possess.

What would settle it

A benchmark dynamical system on which KARC matches or exceeds full KAN performance in accuracy yet requires substantially more parameters or training time than standard reservoir methods would falsify the efficiency-plus-capacity claim.

Figures

Figures reproduced from arXiv: 2606.19984 by Juntian Huang, Jurgen Kurths, Ying Tang.

**Figure 2.** Figure 2: FIG. 2 [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: FIG. 3 [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 3.** Figure 3: FIG. 3 [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: FIG. 4 [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: FIG. 5 [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗

**Figure 5.** Figure 5: FIG. 5 [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

**Figure 1.** Figure 1: Effect of low-rank readout factorization on KARC forecasting for the Kuramoto-Sivashinsky equation. Forecasting results are shown for the Kuramoto-Sivashinsky equation with domain size L=22, using KARC with Fourier basis functions. The top row shows the ground-truth spatiotemporal field. The following rows show KARC forecasts with different low-rank readout ratios, where ratio=dl/dh, dl is the low-rank re… view at source ↗

**Figure 2.** Figure 2: Forecasting performance of RC with different reservoir sizes on the double-scroll system. Each row corresponds to a different reservoir dimension dh, and each column represents one state variable of the double-scroll system. Blue and gray curves denote the predicted and reference trajectories, respectively. 4 2 0 order = 2 1e6 V1 0 1 2 1e6 V2 0 2 4 1e6 I 2 0 2 order = 3 1 0 1 2 0 2 2 0 2 order = 4 1 0 1 2 … view at source ↗

**Figure 3.** Figure 3: Forecasting performance of NG-RC with different orders on the double-scroll system. Rows correspond to different orders and columns represent the three state variables V1, V2 and I. Yellow and orange curves denote the predicted and reference trajectories, respectively. without providing a clear improvement over the third-order model, we adopt the third-order NG-RC model in the main-text comparison as a mor… view at source ↗

**Figure 3.** Figure 3: Forecasting performance of random MLP features with different feature dimensions on the Kuramoto-Sivashinsky equation. The top row shows the reference spatiotemporal field for the Kuramoto-Sivashinsky equation with domain size L = 22. The remaining rows show forecasts generated using random MLP features with different feature dimensions dh, together with the corresponding pointwise prediction errors. Dimen… view at source ↗

**Figure 4.** Figure 4: Forecasting performance of KARC with different bases on the double-scroll system. Rows correspond to different basis functions, including Fourier, B-spline and Chebyshev bases, and columns represent the three state variables V1, V2 and I. Blue and orange curves denote the predicted and reference trajectories, respectively. indicate that the choice of basis function plays an important role in KARC, and that… view at source ↗

**Figure 4.** Figure 4: Forecasting performance of RC with different reservoir dimensions on the double-scroll system. Each row corresponds to a different reservoir dimension dh, and each column represents one state variable of the double-scroll system. The trajectory colors follow the same convention as in the main-text double-scroll experiment. We first examine the sensitivity of RC to reservoir dimension on the double-scroll s… view at source ↗

**Figure 5.** Figure 5: Forecasting performance of RC with different reservoir sizes on the Kuramoto-Sivashinsky equation. The top-left panel shows the reference spatiotemporal field, and the subsequent rows show RC forecasts obtained with different reservoir dimensions dh = 1000, 3000, 8000 and 12000. The right column reports the corresponding pointwise prediction errors. We next investigate whether increasing the reservoir dim… view at source ↗

**Figure 5.** Figure 5: Forecasting performance of NG-RC with different orders on the double-scroll system. Rows correspond to different orders and columns represent the three state variables V1, V2 and I. The trajectory colors follow the same convention as in the main-text double-scroll experiment. 2 0 2 Fourier V1 1 0 1 V2 2 0 2 I 2 0 2 B-spline 1 0 1 2 0 2 0 5 10 15 20 25 max t 2 0 2 Chebyshev 0 5 10 15 20 25 max t 1 0 1 0 5 1… view at source ↗

**Figure 6.** Figure 6: Forecasting performance of VolterraRC with different kernel parameters λ on the KuramotoSivashinsky equation. The top row shows the reference spatiotemporal field, and the subsequent rows show the forecasts and corresponding pointwise errors for different λ. in the figure, however, increasing λ does not monotonically improve the forecasting performance. For relatively small or moderate values of λ, Volter… view at source ↗

**Figure 6.** Figure 6: Forecasting performance of KARC with different bases on the double-scroll system. Rows correspond to different basis functions, including Fourier, B-spline and Chebyshev bases, and columns represent the three state variables V1, V2 and I. The trajectory colors follow the same convention as in the main-text double-scroll experiment. to the oscillatory nature of the double-scroll dynamics [PITH_FULL_IMAGE:f… view at source ↗

**Figure 7.** Figure 7: Forecasting performance of KARC with different bases on the Kuramoto-Sivashinsky equation. The top-left panel shows the reference spatiotemporal field, and the subsequent rows show KARC forecasts obtained using Fourier, B-spline and Chebyshev bases. The right column presents the corresponding pointwise prediction errors. The results compare how the choice of basis function affects the spatiotemporal foreca… view at source ↗

**Figure 7.** Figure 7: Forecasting performance of RC with different reservoir dimensions on the Kuramoto-Sivashinsky equation. The top-left panel shows the reference spatiotemporal field, and the subsequent rows show RC forecasts obtained with different reservoir dimensions dh = 1000, 3000, 8000 and 12000. The right column reports the corresponding pointwise prediction errors. We next extend the sensitivity analysis to the Kuram… view at source ↗

**Figure 8.** Figure 8: Forecasting performance of RC with different reservoir sizes on the shallow water equations. The first row shows the reference solution fields from step 10 to step 100, and the remaining rows show the corresponding prediction errors of RC with reservoir dimensions dh = 3000, dh = 5000, and dh = 8000. Truth step=10 step=20 step=30 step=40 step=50 step=60 step=70 step=80 step=90 step=100 = 0.1 0 = 0.5 0 = 0.… view at source ↗

**Figure 8.** Figure 8: Forecasting performance of VolterraRC with different kernel parameters α on the KuramotoSivashinsky equation. The top row shows the reference spatiotemporal field, and the subsequent rows show the forecasts and corresponding pointwise errors for different α. Finally, [PITH_FULL_IMAGE:figures/full_fig_p035_8.png] view at source ↗

**Figure 9.** Figure 9: Forecasting performance of VolterraRC with different kernel parameters λ on the shallow water equations. The first row shows the reference solution fields from step 10 to step 100. The remaining rows show the corresponding pointwise prediction errors of VolterraRC with different values of the kernel memory parameter λ. We also evaluate RC with different reservoir dimensions on the shallow water equations. … view at source ↗

**Figure 9.** Figure 9: Forecasting performance of KARC with different bases on the Kuramoto-Sivashinsky equation. The top-left panel shows the reference spatiotemporal field, and the subsequent rows show KARC forecasts obtained using Fourier, B-spline and Chebyshev bases. The right column presents the corresponding pointwise prediction errors. The results compare how the choice of basis function affects the spatiotemporal foreca… view at source ↗

**Figure 10.** Figure 10: Forecasting performance of KARC with different bases on the shallow water equation. The first row shows the reference solution fields from step 10 to step 100, and the remaining rows show the corresponding prediction errors of KARC using Fourier, B-spline, and Chebyshev basis functions. equations. Different from the results on the Kuramoto-Sivashinsky equation, increasing λ improves the forecasting perfor… view at source ↗

**Figure 10.** Figure 10: Forecasting performance of RC with different reservoir dimensions on the shallow water equations. The first row shows the reference solution fields from step 10 to step 100, and the remaining rows show the corresponding prediction errors of RC with reservoir dimensions dh = 3000, dh = 5000, and dh = 8000. Truth step=10 step=20 step=30 step=40 step=50 step=60 step=70 step=80 step=90 step=100 = 0.1 0 = 0.5 … view at source ↗

**Figure 11.** Figure 11: Additional text-to-image generation examples. Each row corresponds to a text prompt. Each column shows the results produced by FLUX.1-dev (baseline), Spectrum, KARC with Fourier bases, and KARC with B-spline bases, respectively. We further provide additional text-to-image generation examples using FLUX.1-dev [6] to evaluate the visual quality of KARC-based acceleration. We compare the original spectrum [7… view at source ↗

**Figure 11.** Figure 11: Forecasting performance of VolterraRC with different kernel parameters α on the shallow water equations. The first row shows the reference solution fields from step 10 to step 100. The remaining rows show the corresponding pointwise prediction errors of VolterraRC with different values of the kernel memory parameter α. We next turn to the shallow water equations, a benchmark that embodies additional physi… view at source ↗

**Figure 12.** Figure 12: Forecasting performance of RC, NG-RC, and KARC on the lorenz63 system. Rows correspond to different models, and columns correspond to the three state variables of the lorenz63 system. Λmax denotes the largest Lyapunov exponent, and one unit on the horizontal axis represents one Lyapunov time. In the main text, the double-scroll system was used as a representative low-dimensional chaotic ODE benchmark to a… view at source ↗

**Figure 12.** Figure 12: Forecasting performance of KARC with different bases on the shallow water equation. The first row shows the reference solution fields from step 10 to step 100, and the remaining rows show the corresponding prediction errors of KARC using Fourier, B-spline, and Chebyshev basis functions. and 0.90, the accumulated error is noticeably reduced, and the model better preserves the spatiotemporal evolution of th… view at source ↗

**Figure 13.** Figure 13: (a) Schematic illustration of the hybrid KARC+FNO-2D architecture. (b) Qualitative comparison between KARC-only and KARC+FNO-2D predictions at selected forecasting times. The rows show the 64 × 64 ground truth, the KARC-only prediction computed at 32 × 32 resolution and upsampled to 64 × 64, and the final 64 × 64 KARC+FNO-2D prediction, respectively. (c) Forecasting errors on the Navier-Stokes equation.Th… view at source ↗

**Figure 14.** Figure 14: Forecasting performance of RC, NG-RC, and KARC on the Lorenz63 system. Rows correspond to different models, and columns correspond to the three state variables of the Lorenz63 system. Λmax denotes the largest Lyapunov exponent, and one unit on the horizontal axis represents one Lyapunov time. In the main text, the double-scroll system was used as a representative low-dimensional chaotic ODE benchmark to a… view at source ↗

**Figure 15.** Figure 15: (a) Schematic illustration of the hybrid KARC+FNO-2D architecture. (b) Qualitative comparison between KARC-only and KARC+FNO-2D predictions at selected forecasting times. The rows show the 64 × 64 ground truth, the KARC-only prediction computed at 32 × 32 resolution and upsampled to 64 × 64, and the final 64 × 64 KARC+FNO-2D prediction, respectively. (c) Forecasting errors on the Navier-Stokes equation.Th… view at source ↗

read the original abstract

Reservoir computing offers a lightweight framework for forecasting dynamical systems but may struggle to capture long-range dependencies due to limited representational capacity. Conventional reservoir computing recurrently uses trainable reservoirs with hyperparameter sensitivity, while the next-generation reservoir computing removes recurrence at the cost of rapidly growing feature dimensions. Here, we develop Kolmogorov-Arnold Reservoir Computing (KARC), which replaces reservoirs with explicit basis-function expansions inspired by the Kolmogorov-Arnold representation theorem. We rigorously show that KARC is a lightweight design of Kolmogorov-Arnold networks (KANs), preserving the potential expressive capacity of KANs while admitting efficient closed-form training of reservoir computing. At comparable cost, KARC outperforms existing reservoir computing methods on challenging benchmarks including partial differential equations. It can also be integrated with generative diffusion models for text-to-image generation. This work thus establishes a principled bridge between reservoir computing and KANs, enabling efficient and high-fidelity dynamical system forecasting.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

KARC swaps standard reservoirs for explicit Kolmogorov-Arnold basis expansions to keep closed-form training while claiming KAN-level capacity, but the abstract leaves the actual derivations and benchmark controls uncheckable.

read the letter

The main thing here is a concrete construction that replaces the usual reservoir layer with basis-function expansions drawn from the Kolmogorov-Arnold theorem. This lets the method train in closed form like next-generation reservoir computing while the authors argue it retains the representational power of full KANs.

The construction itself looks like the genuinely new piece. Prior work has explored KANs for dynamical modeling and reservoir methods for forecasting, but the specific lightweight hybrid that keeps the training procedure simple is not already in the cited literature.

The paper does a reasonable job stating the practical motivation: standard reservoirs can miss long-range structure, and next-gen versions blow up in feature size. Framing KARC as a bridge that avoids both problems is clear enough.

The soft spots sit in the strength of the supporting claims. The abstract says the authors “rigorously show” preservation of expressive capacity and benchmark superiority on PDE tasks, yet supplies no derivation outline, no error bars, and no dataset or hyperparameter details. Without those, it is impossible to judge whether the basis choice actually delivers the claimed capacity or whether the reported gains depend on favorable post-hoc selections. The diffusion-model integration is mentioned only in passing and adds little.

This is aimed at researchers already working on scientific machine learning for dynamical systems who need something lighter than full KAN training. A reader who wants to test the idea on their own forecasting task could get value from the construction even if the superiority claims need checking.

I would send it to peer review so the derivations and experimental controls can be examined directly.

Referee Report

1 major / 0 minor

Summary. The manuscript introduces Kolmogorov-Arnold Reservoir Computing (KARC), which substitutes explicit basis-function expansions inspired by the Kolmogorov-Arnold representation theorem for conventional reservoirs in reservoir computing. It claims KARC is a lightweight design of Kolmogorov-Arnold networks (KANs) that preserves their expressive capacity while enabling efficient closed-form training, outperforms existing RC methods on dynamical-system benchmarks including PDEs at comparable cost, and integrates with generative diffusion models.

Significance. If the central claims of preserved expressive capacity and rigorous closed-form training hold, the work would establish a useful bridge between reservoir computing and KANs, offering a principled route to efficient, high-fidelity forecasting of dynamical systems with potential impact in scientific machine learning.

major comments (1)

Abstract: the claim of a 'rigorous' demonstration that KARC preserves the expressive capacity of KANs is load-bearing for the central contribution, yet the provided material supplies no derivation, theorem statement, or proof outline; without explicit verification of this step the superiority and preservation assertions cannot be assessed.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their review and for highlighting this critical point about the central claim. We address the comment below and commit to a revision that supplies the missing supporting material.

read point-by-point responses

Referee: Abstract: the claim of a 'rigorous' demonstration that KARC preserves the expressive capacity of KANs is load-bearing for the central contribution, yet the provided material supplies no derivation, theorem statement, or proof outline; without explicit verification of this step the superiority and preservation assertions cannot be assessed.

Authors: We agree the abstract's assertion requires explicit verification. The current manuscript states the claim at a high level in Section 3 but does not include a formal theorem or proof outline. In the revised manuscript we will insert a new subsection (Section 3.2) containing: (i) the theorem statement that KARC constitutes a lightweight KAN design preserving expressive capacity via finite Kolmogorov-Arnold basis expansions in the reservoir, and (ii) a concise proof sketch relying on the universal approximation property of KANs together with the closed-form linear readout. This addition will allow direct assessment of the preservation argument and the closed-form training claim. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper's central claim is that KARC replaces conventional reservoirs with explicit basis-function expansions drawn from the Kolmogorov-Arnold theorem, yielding a lightweight KAN variant that supports closed-form training while preserving expressive capacity. This construction is presented as an independent design choice whose equivalence to KANs is asserted via a rigorous demonstration rather than by redefining the target quantity in terms of itself or by fitting parameters to the very quantities being predicted. No self-citation chain is invoked as the sole justification for a uniqueness theorem, no fitted input is relabeled as a prediction, and no ansatz is smuggled through prior self-work. The derivation therefore remains self-contained against external benchmarks and does not reduce to its inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The design rests on the Kolmogorov-Arnold representation theorem as a standard mathematical fact and introduces the KARC architecture itself; no free parameters or invented entities are mentioned in the abstract.

axioms (1)

standard math The Kolmogorov-Arnold representation theorem applies to the explicit basis-function expansions used in place of reservoirs.
Invoked to justify the substitution that preserves expressive capacity.

pith-pipeline@v0.9.1-grok · 5680 in / 1059 out tokens · 32384 ms · 2026-06-30T10:49:46.785000+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

54 extracted references · 2 canonical work pages · 2 internal anchors

[1]

Lim and S

B. Lim and S. Zohren, Time-series forecasting with deep learning: a survey, Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 379, 20200209 (2021)

2021
[2]

Kovachki, Z

N. Kovachki, Z. Li, B. Liu, K. Azizzadenesheli, K. Bhattacharya, A. Stuart, and A. Anandkumar, Neural operator: Learning maps between function spaces with applications to PDEs, J. Mach. Learn. Res. 24, 1 (2023)

2023
[3]

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, arXiv preprint arXiv:1412.3555 (2014)

work page internal anchor Pith review Pith/arXiv arXiv 2014
[4]

Vaswani, N

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, Attention is all you need, in Advances in Neural Information Processing Systems , Vol. 30 (2017) pp. 5998–6008

2017
[5]

Z. Li, N. B. Kovachki, K. Azizzadenesheli, B. liu, K. Bhattacharya, A. Stuart, and A. Anandkumar, Fourier neural operator for parametric partial differential equations, in Int. Conf. Learn. Represent. (2021)

2021
[6]

Wang and C

T. Wang and C. Wang, Latent neural operator for solving forward and inverse PDE problems, in Adv. Neural Inf. Process. Syst., Vol. 37 (2024)

2024
[7]

L. Lu, P. Jin, G. Pang, Z. Zhang, and G. E. Karniadakis, Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators, Nat. Mach. Intell. 3, 218 (2021)

2021
[8]

Goswami, A

S. Goswami, A. Bora, Y. Yu, and G. E. Karniadakis, Physics-informed deep neural operator networks, in Machine Learning in Modeling and Simulation: Methods and Applications , Computational Methods in Engineering & the Sciences, edited by T. Rabczuk and K.-J. Bathe (Springer, 2023) pp. 219–254

2023
[9]

N. B. Kovachki, S. Lanthaler, and A. M. Stuart, Operator learning: Algorithms and analysis, in Numerical Analysis Meets Machine Learning , Handb. Numer. Anal., Vol. 25 (Elsevier, 2024) pp. 419–467

2024
[10]

S. Liu, Y. Yu, T. Zhang, H. Liu, X. Liu, and D. Meng, Architectures, variants, and performance of neural operators: A comparative review, Neurocomputing 648, 130518 (2025)

2025
[11]

Benidis, S

K. Benidis, S. S. Rangapuram, V. Flunkert, Y. Wang, D. Maddix, C. Turkmen, J. Gasthaus, M. Bohlke-Schneider, D. Salinas, L. Stella, et al. , Deep learning for time series forecasting: Tutorial and literature survey, ACM Comput. Surv. 55, 1 (2022)

2022
[13]

Pathak, B

J. Pathak, B. Hunt, M. Girvan, Z. Lu, and E. Ott, Model-free prediction of large spatiotemporally chaotic systems from data: A reservoir computing approach, Phys. Rev. Lett. 120, 024102 (2018)

2018
[15]

N. Lin, S. Wang, Y. Li, B. Wang, S. Shi, Y. He, W. Zhang, Y. Yu, Y. Zhang, X. Zhang, et al. , Resistive memory-based zero-shot liquid state machine for multimodal event data learning, Nat. Comput. Sci. 5, 37 (2025)

2025
[16]

R. S. Zimmermann and U. Parlitz, Observing spatio-temporal dynamics of excitable media using reservoir computing, Chaos 28, 043118 (2018)

2018
[17]

Xiong and H

Y. Xiong and H. Zhao, Chaotic time series prediction based on long short-term memory neural networks, Sci. China Phy. Mech. Astron. 49, 120501 (2019)

2019
[18]

H. Fan, J. Jiang, C. Zhang, X. Wang, and Y.-C. Lai, Long-term prediction of chaotic systems with machine learning, Phys. Rev. Res. 2, 012080 (2020)

2020
[19]

Rafayelyan, J

M. Rafayelyan, J. Dong, Y. Tan, F. Krzakala, and S. Gigan, Large-scale optical reservoir computing for spatiotemporal chaotic systems prediction, Phys. Rev. X 10, 041037 (2020)

2020
[20]

Z. Lin, Z. Lu, Z. Di, and Y. Tang, Learning noise-induced transitions by multi-scaling reservoir computing, Nat. Commun. 15, 6584 (2024)

2024
[21]

H. Tan, L. Shi, S. Wang, and S.-X. Qu, Improving model-free prediction of chaotic dynamics by purifying the incomplete input, AIP Adv. 14 (2024)

2024
[22]

X. Li, Q. Zhu, C. Zhao, X. Duan, B. Zhao, X. Zhang, H. Ma, J. Sun, and W. Lin, Higher-order granger reservoir D Error Bound of KARC for Fading Memory Dynamical Systems 22 computing: simultaneously achieving scalable complex structures inference and accurate dynamics prediction, Nat. Commun. 15, 2506 (2024)

2024
[23]

X. Han, Z. Qi, V. Kundrat, H. Li, Z. Li, X. Guo, P. Mao, W. Zheng, S. Hou, R. Liu, et al. , Very-large-scale mimetic optogenetic synapses for physical reservoir computing, Nat. Commun. 17, 1514 (2026)

2026
[24]

Amann, K

A. Amann, K. Lüdge, U. Parlitz, and M. Small, Nonlinear dynamics of reservoir computing: Theory, realization, and application, Chaos 36 (2026)

2026
[25]

Grigoryeva and J.-P

L. Grigoryeva and J.-P. Ortega, Universal discrete-time reservoir computers with stochastic inputs and linear readouts using non-homogeneous state-aﬀine systems, J. Mach. Learn. Res. 19, 1 (2018)

2018
[26]

Grigoryeva and J.-P

L. Grigoryeva and J.-P. Ortega, Echo state networks are universal, Neural Netw. 108, 495 (2018)

2018
[27]

Gonon and J.-P

L. Gonon and J.-P. Ortega, Reservoir computing universality with stochastic inputs, IEEE Trans. Neural Netw. Learn. Syst. 31, 100 (2019)

2019
[28]

A. Hart, J. Hook, and J. Dawes, Embedding and approximation theorems for echo state networks, Neural Netw. 128, 234 (2020)

2020
[29]

Gonon and J.-P

L. Gonon and J.-P. Ortega, Fading memory echo state networks are universal, Neural Netw. 138, 10 (2021)

2021
[30]

L. A. Thiede and U. Parlitz, Gradient based hyperparameter optimization in echo state networks, Neural Netw. 115, 23 (2019)

2019
[31]

Ren and H

B. Ren and H. Ma, Global optimization of hyper-parameters in reservoir computing, Electron. Res. Arch. 30, 2719 (2022)

2022
[32]

M. Yan, C. Huang, P. Bienstman, P. Tino, W. Lin, and J. Sun, Emerging opportunities and challenges for the future of reservoir computing, Nat. Commun. 15, 2056 (2024)

2056
[33]

Lukoševičius and H

M. Lukoševičius and H. Jaeger, Reservoir computing approaches to recurrent neural network training, Comput. Sci. Rev. 3, 127 (2009)

2009
[34]

Martin and C

E. Martin and C. Cundy, Parallelizing linear recurrent neural nets over sequence length, in Int. Conf. Learn. Represent. (2018)

2018
[35]

D. J. Gauthier, E. Bollt, A. Griﬀith, and W. A. S. Barbosa, Next generation reservoir computing, Nat. Commun. 12, 5564 (2021)

2021
[37]

Grigoryeva, H

L. Grigoryeva, H. L. J. Ting, and J.-P. Ortega, Infinite-dimensional next-generation reservoir computing, Phys. Rev. E 111, 035305 (2025)

2025
[38]

Cestnik and E

R. Cestnik and E. A. Martens, Next-generation reservoir computing for dynamical inference, Chaos 36, 013115 (2026)

2026
[39]

A. N. Kolmogorov, On the representation of continuous functions of many variables by superposition of continuous functions of one variable and addition, Dokl. Akad. Nauk SSSR 114, 953 (1957)

1957
[40]

Z. Liu, Y. Wang, S. Vaidya, F. Ruehle, J. Halverson, M. Soljačić, T. Y. Hou, and M. Tegmark, KAN: Kolmogorov-arnold networks, in Proceedings of the International Conference on Learning Representations (2025)

2025
[41]

Z. Liu, M. Tegmark, P. Ma, W. Matusik, and Y. Wang, Kolmogorov-arnold networks meet science, Phys. Rev. X 15, 041051 (2025)

2025
[42]

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

J. von Oswald, N. Scherrer, S. Kobayashi, L. Versari, S. Yang, M. Schlegel, K. Maile, Y. Schimpf, O. Sieberling, A. Meulemans, et al. , Mesanet: Sequence modeling by locally optimal test-time training, arXiv:2506.05233 (2025)

work page internal anchor Pith review Pith/arXiv arXiv 2025
[43]

J. Han, J. Shi, P. Li, H. Ye, Q. Guo, and S. Ermon, Adaptive spectral feature forecasting for diffusion sampling acceleration, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2026) pp. 43320–43330

2026
[44]

Gonon, L

L. Gonon, L. Grigoryeva, and J.-P. Ortega, Reservoir kernels and volterra series, IEEE Trans. Neural Netw. Learn. Syst. , 1 (2025)

2025
[45]

G. K. Vallis, Atmospheric and Oceanic Fluid Dynamics: Fundamentals and Large-Scale Circulation , 2nd ed. (Cambridge University Press, 2017)

2017
[47]

G. Li, L. Huang, and Y. Lei, Reservoir computing meeting kolmogorov-arnold networks: prediction of high-dimensional chaotic systems, Chaos 35, 103120 (2025)

2025
[48]

Z. Lin, J. Kurths, and Y. Tang, Multi-scaling reservoir computing learns noise-induced transitions with Lévy noise, Chaos 35, 073132 (2025)

2025
[49]

Raissi, P

M. Raissi, P. Perdikaris, and G. E. Karniadakis, Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations, J. Comput. Phys. 378, 686 (2019)

2019
[50]

Z. Li, H. Zheng, N. Kovachki, D. Jin, H. Chen, B. Liu, K. Azizzadenesheli, and A. Anandkumar, Physics-informed neural operator for learning partial differential equations, ACM/IMS J. Data Sci. 1, 1 (2024) . Supplementary Information: Kolmogorov-Arnold Reservoir Computing Juntian Huang, 1 Jürgen Kurths, 2, 3, 4 and Ying Tang 1, 5, 6, 7, ∗ 1Institute of Fun...

2024
[51]

D. J. Gauthier, E. Bollt, A. Griﬃth, and W. A. S. Barbosa, Next generation reservoir computing, Nat. Commun. 12, 5564 (2021)

2021
[52]

Grigoryeva, H

L. Grigoryeva, H. L. J. Ting, and J.-P. Ortega, Inﬁnite-dimensional next-generation reservoir computing, Phys. Rev. E 111, 035305 (2025)

2025
[53]

echo state

H. Jaeger, The “echo state” approach to analysing and training recurrent neural networks–with an erratum note , GMD Technical Report 148 (German National Research Center for Information Technology, Bonn, Germany, 2001)

2001
[54]

Maass, T

W. Maass, T. Natschläger, and H. Markram, Real-time computing without stable states: A new framework for neural computation based on perturbations, Neural Comput. 14, 2531 (2002)

2002
[55]

E. Bollt, On explaining the surprising success of reservoir computing forecaster of chaos? the universal machine learning dynamical system with contrast to var and dmd, Chaos 31, 013108 (2021)

2021
[56]

Black Forest Labs, FLUX, https://github.com/black-forest-labs/flux (2024), gitHub repository

2024
[57]

J. Han, J. Shi, P. Li, H. Ye, Q. Guo, and S. Ermon, Adaptive spectral feature forecasting for diﬀusion sampling accelera- tion, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2026) pp. 43320–43330

2026
[58]

Z. Li, N. B. Kovachki, K. Azizzadenesheli, B. liu, K. Bhattacharya, A. Stuart, and A. Anandkumar, Fourier neural operator for parametric partial diﬀerential equations, in Int. Conf. Learn. Represent. (2021)

2021

[1] [1]

Lim and S

B. Lim and S. Zohren, Time-series forecasting with deep learning: a survey, Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 379, 20200209 (2021)

2021

[2] [2]

Kovachki, Z

N. Kovachki, Z. Li, B. Liu, K. Azizzadenesheli, K. Bhattacharya, A. Stuart, and A. Anandkumar, Neural operator: Learning maps between function spaces with applications to PDEs, J. Mach. Learn. Res. 24, 1 (2023)

2023

[3] [3]

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, arXiv preprint arXiv:1412.3555 (2014)

work page internal anchor Pith review Pith/arXiv arXiv 2014

[4] [4]

Vaswani, N

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, Attention is all you need, in Advances in Neural Information Processing Systems , Vol. 30 (2017) pp. 5998–6008

2017

[5] [5]

Z. Li, N. B. Kovachki, K. Azizzadenesheli, B. liu, K. Bhattacharya, A. Stuart, and A. Anandkumar, Fourier neural operator for parametric partial differential equations, in Int. Conf. Learn. Represent. (2021)

2021

[6] [6]

Wang and C

T. Wang and C. Wang, Latent neural operator for solving forward and inverse PDE problems, in Adv. Neural Inf. Process. Syst., Vol. 37 (2024)

2024

[7] [7]

L. Lu, P. Jin, G. Pang, Z. Zhang, and G. E. Karniadakis, Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators, Nat. Mach. Intell. 3, 218 (2021)

2021

[8] [8]

Goswami, A

S. Goswami, A. Bora, Y. Yu, and G. E. Karniadakis, Physics-informed deep neural operator networks, in Machine Learning in Modeling and Simulation: Methods and Applications , Computational Methods in Engineering & the Sciences, edited by T. Rabczuk and K.-J. Bathe (Springer, 2023) pp. 219–254

2023

[9] [9]

N. B. Kovachki, S. Lanthaler, and A. M. Stuart, Operator learning: Algorithms and analysis, in Numerical Analysis Meets Machine Learning , Handb. Numer. Anal., Vol. 25 (Elsevier, 2024) pp. 419–467

2024

[10] [10]

S. Liu, Y. Yu, T. Zhang, H. Liu, X. Liu, and D. Meng, Architectures, variants, and performance of neural operators: A comparative review, Neurocomputing 648, 130518 (2025)

2025

[11] [11]

Benidis, S

K. Benidis, S. S. Rangapuram, V. Flunkert, Y. Wang, D. Maddix, C. Turkmen, J. Gasthaus, M. Bohlke-Schneider, D. Salinas, L. Stella, et al. , Deep learning for time series forecasting: Tutorial and literature survey, ACM Comput. Surv. 55, 1 (2022)

2022

[12] [13]

Pathak, B

J. Pathak, B. Hunt, M. Girvan, Z. Lu, and E. Ott, Model-free prediction of large spatiotemporally chaotic systems from data: A reservoir computing approach, Phys. Rev. Lett. 120, 024102 (2018)

2018

[13] [15]

N. Lin, S. Wang, Y. Li, B. Wang, S. Shi, Y. He, W. Zhang, Y. Yu, Y. Zhang, X. Zhang, et al. , Resistive memory-based zero-shot liquid state machine for multimodal event data learning, Nat. Comput. Sci. 5, 37 (2025)

2025

[14] [16]

R. S. Zimmermann and U. Parlitz, Observing spatio-temporal dynamics of excitable media using reservoir computing, Chaos 28, 043118 (2018)

2018

[15] [17]

Xiong and H

Y. Xiong and H. Zhao, Chaotic time series prediction based on long short-term memory neural networks, Sci. China Phy. Mech. Astron. 49, 120501 (2019)

2019

[16] [18]

H. Fan, J. Jiang, C. Zhang, X. Wang, and Y.-C. Lai, Long-term prediction of chaotic systems with machine learning, Phys. Rev. Res. 2, 012080 (2020)

2020

[17] [19]

Rafayelyan, J

M. Rafayelyan, J. Dong, Y. Tan, F. Krzakala, and S. Gigan, Large-scale optical reservoir computing for spatiotemporal chaotic systems prediction, Phys. Rev. X 10, 041037 (2020)

2020

[18] [20]

Z. Lin, Z. Lu, Z. Di, and Y. Tang, Learning noise-induced transitions by multi-scaling reservoir computing, Nat. Commun. 15, 6584 (2024)

2024

[19] [21]

H. Tan, L. Shi, S. Wang, and S.-X. Qu, Improving model-free prediction of chaotic dynamics by purifying the incomplete input, AIP Adv. 14 (2024)

2024

[20] [22]

X. Li, Q. Zhu, C. Zhao, X. Duan, B. Zhao, X. Zhang, H. Ma, J. Sun, and W. Lin, Higher-order granger reservoir D Error Bound of KARC for Fading Memory Dynamical Systems 22 computing: simultaneously achieving scalable complex structures inference and accurate dynamics prediction, Nat. Commun. 15, 2506 (2024)

2024

[21] [23]

X. Han, Z. Qi, V. Kundrat, H. Li, Z. Li, X. Guo, P. Mao, W. Zheng, S. Hou, R. Liu, et al. , Very-large-scale mimetic optogenetic synapses for physical reservoir computing, Nat. Commun. 17, 1514 (2026)

2026

[22] [24]

Amann, K

A. Amann, K. Lüdge, U. Parlitz, and M. Small, Nonlinear dynamics of reservoir computing: Theory, realization, and application, Chaos 36 (2026)

2026

[23] [25]

Grigoryeva and J.-P

L. Grigoryeva and J.-P. Ortega, Universal discrete-time reservoir computers with stochastic inputs and linear readouts using non-homogeneous state-aﬀine systems, J. Mach. Learn. Res. 19, 1 (2018)

2018

[24] [26]

Grigoryeva and J.-P

L. Grigoryeva and J.-P. Ortega, Echo state networks are universal, Neural Netw. 108, 495 (2018)

2018

[25] [27]

Gonon and J.-P

L. Gonon and J.-P. Ortega, Reservoir computing universality with stochastic inputs, IEEE Trans. Neural Netw. Learn. Syst. 31, 100 (2019)

2019

[26] [28]

A. Hart, J. Hook, and J. Dawes, Embedding and approximation theorems for echo state networks, Neural Netw. 128, 234 (2020)

2020

[27] [29]

Gonon and J.-P

L. Gonon and J.-P. Ortega, Fading memory echo state networks are universal, Neural Netw. 138, 10 (2021)

2021

[28] [30]

L. A. Thiede and U. Parlitz, Gradient based hyperparameter optimization in echo state networks, Neural Netw. 115, 23 (2019)

2019

[29] [31]

Ren and H

B. Ren and H. Ma, Global optimization of hyper-parameters in reservoir computing, Electron. Res. Arch. 30, 2719 (2022)

2022

[30] [32]

M. Yan, C. Huang, P. Bienstman, P. Tino, W. Lin, and J. Sun, Emerging opportunities and challenges for the future of reservoir computing, Nat. Commun. 15, 2056 (2024)

2056

[31] [33]

Lukoševičius and H

M. Lukoševičius and H. Jaeger, Reservoir computing approaches to recurrent neural network training, Comput. Sci. Rev. 3, 127 (2009)

2009

[32] [34]

Martin and C

E. Martin and C. Cundy, Parallelizing linear recurrent neural nets over sequence length, in Int. Conf. Learn. Represent. (2018)

2018

[33] [35]

D. J. Gauthier, E. Bollt, A. Griﬀith, and W. A. S. Barbosa, Next generation reservoir computing, Nat. Commun. 12, 5564 (2021)

2021

[34] [37]

Grigoryeva, H

L. Grigoryeva, H. L. J. Ting, and J.-P. Ortega, Infinite-dimensional next-generation reservoir computing, Phys. Rev. E 111, 035305 (2025)

2025

[35] [38]

Cestnik and E

R. Cestnik and E. A. Martens, Next-generation reservoir computing for dynamical inference, Chaos 36, 013115 (2026)

2026

[36] [39]

A. N. Kolmogorov, On the representation of continuous functions of many variables by superposition of continuous functions of one variable and addition, Dokl. Akad. Nauk SSSR 114, 953 (1957)

1957

[37] [40]

Z. Liu, Y. Wang, S. Vaidya, F. Ruehle, J. Halverson, M. Soljačić, T. Y. Hou, and M. Tegmark, KAN: Kolmogorov-arnold networks, in Proceedings of the International Conference on Learning Representations (2025)

2025

[38] [41]

Z. Liu, M. Tegmark, P. Ma, W. Matusik, and Y. Wang, Kolmogorov-arnold networks meet science, Phys. Rev. X 15, 041051 (2025)

2025

[39] [42]

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

J. von Oswald, N. Scherrer, S. Kobayashi, L. Versari, S. Yang, M. Schlegel, K. Maile, Y. Schimpf, O. Sieberling, A. Meulemans, et al. , Mesanet: Sequence modeling by locally optimal test-time training, arXiv:2506.05233 (2025)

work page internal anchor Pith review Pith/arXiv arXiv 2025

[40] [43]

J. Han, J. Shi, P. Li, H. Ye, Q. Guo, and S. Ermon, Adaptive spectral feature forecasting for diffusion sampling acceleration, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2026) pp. 43320–43330

2026

[41] [44]

Gonon, L

L. Gonon, L. Grigoryeva, and J.-P. Ortega, Reservoir kernels and volterra series, IEEE Trans. Neural Netw. Learn. Syst. , 1 (2025)

2025

[42] [45]

G. K. Vallis, Atmospheric and Oceanic Fluid Dynamics: Fundamentals and Large-Scale Circulation , 2nd ed. (Cambridge University Press, 2017)

2017

[43] [47]

G. Li, L. Huang, and Y. Lei, Reservoir computing meeting kolmogorov-arnold networks: prediction of high-dimensional chaotic systems, Chaos 35, 103120 (2025)

2025

[44] [48]

Z. Lin, J. Kurths, and Y. Tang, Multi-scaling reservoir computing learns noise-induced transitions with Lévy noise, Chaos 35, 073132 (2025)

2025

[45] [49]

Raissi, P

M. Raissi, P. Perdikaris, and G. E. Karniadakis, Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations, J. Comput. Phys. 378, 686 (2019)

2019

[46] [50]

Z. Li, H. Zheng, N. Kovachki, D. Jin, H. Chen, B. Liu, K. Azizzadenesheli, and A. Anandkumar, Physics-informed neural operator for learning partial differential equations, ACM/IMS J. Data Sci. 1, 1 (2024) . Supplementary Information: Kolmogorov-Arnold Reservoir Computing Juntian Huang, 1 Jürgen Kurths, 2, 3, 4 and Ying Tang 1, 5, 6, 7, ∗ 1Institute of Fun...

2024

[47] [51]

D. J. Gauthier, E. Bollt, A. Griﬃth, and W. A. S. Barbosa, Next generation reservoir computing, Nat. Commun. 12, 5564 (2021)

2021

[48] [52]

Grigoryeva, H

L. Grigoryeva, H. L. J. Ting, and J.-P. Ortega, Inﬁnite-dimensional next-generation reservoir computing, Phys. Rev. E 111, 035305 (2025)

2025

[49] [53]

echo state

H. Jaeger, The “echo state” approach to analysing and training recurrent neural networks–with an erratum note , GMD Technical Report 148 (German National Research Center for Information Technology, Bonn, Germany, 2001)

2001

[50] [54]

Maass, T

W. Maass, T. Natschläger, and H. Markram, Real-time computing without stable states: A new framework for neural computation based on perturbations, Neural Comput. 14, 2531 (2002)

2002

[51] [55]

E. Bollt, On explaining the surprising success of reservoir computing forecaster of chaos? the universal machine learning dynamical system with contrast to var and dmd, Chaos 31, 013108 (2021)

2021

[52] [56]

Black Forest Labs, FLUX, https://github.com/black-forest-labs/flux (2024), gitHub repository

2024

[53] [57]

J. Han, J. Shi, P. Li, H. Ye, Q. Guo, and S. Ermon, Adaptive spectral feature forecasting for diﬀusion sampling accelera- tion, in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2026) pp. 43320–43330

2026

[54] [58]

Z. Li, N. B. Kovachki, K. Azizzadenesheli, B. liu, K. Bhattacharya, A. Stuart, and A. Anandkumar, Fourier neural operator for parametric partial diﬀerential equations, in Int. Conf. Learn. Represent. (2021)

2021