Deep Learning Surrogates for Emulating Stochastic Climate Tipping Dynamics

Adeline Hillier; Anand Gnanadesikan; Caroline Tang; Jay Brett; Jenelle Millison; Jennifer Sleeman

arxiv: 2605.20580 · v1 · pith:GGSP62BRnew · submitted 2026-05-20 · 💻 cs.LG

Deep Learning Surrogates for Emulating Stochastic Climate Tipping Dynamics

Adeline Hillier , Jennifer Sleeman , Jay Brett , Caroline Tang , Jenelle Millison , Anand Gnanadesikan This is my paper

Pith reviewed 2026-05-21 07:11 UTC · model grok-4.3

classification 💻 cs.LG

keywords deep learning surrogateclimate tipping dynamicsocean transporttemporal fusion transformerstochastic uncertaintyAtlantic collapsePacific collapsecomputational speedup

0 comments

The pith

A modified Temporal Fusion Transformer emulates stochastic ocean tipping with 465x speedup while preserving differentiability.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that a dynamics-informed Temporal Fusion Transformer can serve as a data-driven surrogate for expensive numerical simulations of global ocean transport. It processes up to 21 non-stationary time series together with static covariates for parameters and initial conditions. Targeted changes to the architecture and training objective allow the model to predict the timing of Atlantic and Pacific collapses at high fidelity while reproducing the spread of transition times seen in stochastic ensembles. A reader would care because the resulting surrogate runs 465 times faster than the original simulator and supports gradient computations for further analysis or optimization.

Core claim

The central claim is that modifications to the Temporal Fusion Transformer architecture and objective function enable the model to faithfully reproduce the stochastic timing of tipping events from non-stationary multivariate time series of ocean transport. The surrogate anticipates collapse timings to high fidelity, captures uncertainty across ensemble predictions, achieves a 465x computational speedup over the numerical simulator, and remains differentiable with respect to parameters and initial conditions.

What carries the argument

The dynamics-informed Temporal Fusion Transformer with modifications to architecture and objective function for predicting tipping from multivariate non-stationary time series.

If this is right

Large-scale ensemble runs become feasible for quantifying uncertainty in collapse timing.
Gradient-based optimization and sensitivity analysis can now be performed directly on tipping behavior.
Similar surrogates could accelerate exploration of parameter spaces in other Earth system models.
Forecasts of tipping events can extend across thousands of time steps at reduced computational cost.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Differentiability could support inverse modeling to infer unobserved initial conditions from real ocean observations.
The approach might extend to coupled tipping elements, such as interactions between ocean circulation and ice sheets.
Real-time early-warning systems for climate shifts could incorporate such lightweight surrogates for continuous monitoring.

Load-bearing premise

The modifications to the TFT architecture and objective function are sufficient to faithfully reproduce the stochastic timing of tipping events from the underlying non-stationary multivariate time series without introducing systematic bias in the predicted collapse distributions.

What would settle it

Generate a new ensemble of collapse times from the original numerical simulator on held-out initial conditions and parameters, then compare the full distribution of predicted timings and variances against those produced by the surrogate.

Figures

Figures reproduced from arXiv: 2605.20580 by Adeline Hillier, Anand Gnanadesikan, Caroline Tang, Jay Brett, Jenelle Millison, Jennifer Sleeman.

**Figure 2.** Figure 2: (a) Atlantic overturning forecast result for a representative sample from the stochastic [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: Autoregressive rollouts for all predicted variables on a representative test trajectory (median [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Two examples (top, bottom) demonstrating ensemble fidelity of surrogate forecasts for [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Parity plots of predicted versus true transition timing for all 389 Atlantic collapse examples [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

read the original abstract

This work explores a dynamics-informed Temporal Fusion Transformer (TFT) as a data-driven surrogate for computationally intensive Earth system simulations. Focusing on multivariate time series describing global ocean transport, we demonstrate the surrogate's ability to forecast tip events across thousands of time steps. The data involve up to 21 non-stationary time series in addition to static covariates describing free parameters and initial conditions. Modifications to the architecture and objective function yield a surrogate that anticipates the timing of Atlantic and Pacific collapses to high fidelity and captures the stochastic uncertainty in transition timing across ensemble predictions. The learned surrogate achieves a 465x computational speedup over the numerical simulator while maintaining differentiability with respect to parameters and initial conditions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

TFT surrogate for stochastic ocean tipping gives 465x speedup and differentiability but abstract leaves the fidelity on timing distributions and uncertainty unproven.

read the letter

Hey, the main thing here is a modified Temporal Fusion Transformer trained as a surrogate on ocean transport simulations that hits a 465x speedup over the full model while staying differentiable with respect to parameters and initial conditions. They use it to predict timing of Atlantic and Pacific collapses and to capture the spread in those times across ensembles from up to 21 non-stationary series plus static covariates. That combination for stochastic tipping looks like the actual new piece compared with earlier deterministic climate emulators. The practical upside is clear: the speed lets people run much larger ensembles for risk work, and the differentiability supports sensitivity checks without rerunning the simulator every time. Training directly on simulator outputs also keeps the circularity low. The soft spot sits in the validation. The abstract claims high-fidelity timing and uncertainty capture, but it gives no error metrics, no validation-split details, and no direct comparison of the predicted collapse distributions to the original stochastic runs. If the architecture tweaks or custom loss are shifting the heavy-tailed transition statistics, that would matter for downstream use even if point predictions look decent. The full paper and any released code would need to show held-out ensemble tests to settle this. This is for climate modelers and integrated-assessment people who need faster proxies for tipping probabilities. A reader already running stochastic ensembles would see the most immediate value. It deserves a serious referee because the speedup claim is concrete and the application addresses a real bottleneck, even if the review will focus on tighter quantitative checks for bias in the timing distributions. I'd send it out for review rather than desk reject.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a dynamics-informed Temporal Fusion Transformer (TFT) as a data-driven surrogate for emulating stochastic tipping dynamics in multivariate time series from global ocean transport Earth system simulations. It claims to forecast Atlantic and Pacific collapse events over thousands of time steps with high fidelity, capture stochastic uncertainty in transition timing across ensembles, achieve a 465x computational speedup, and maintain differentiability with respect to parameters and initial conditions.

Significance. If the central claims regarding fidelity and unbiased uncertainty capture are substantiated, the work could enable much faster ensemble simulations and sensitivity analyses for climate tipping points, which are otherwise computationally prohibitive. The differentiability and speedup are notable strengths that could support integration into optimization or adjoint-based methods in Earth system modeling.

major comments (2)

[Abstract] Abstract: The abstract asserts 'high fidelity' timing predictions and capture of 'stochastic uncertainty in transition timing' without providing any quantitative metrics (e.g., MAE or RMSE on collapse times, statistical distances between predicted and simulated timing distributions, or ensemble coverage rates). This absence is load-bearing for the central claim of faithful emulation of the underlying stochastic dynamics.
[Methods] Methods (TFT modifications and objective function): The architecture changes (attention, static covariates for parameters/initial conditions) and custom loss are presented as sufficient to reproduce non-stationary stochastic timing from the 21-variable series without systematic bias in collapse distributions. However, no explicit validation (e.g., Kolmogorov-Smirnov tests, quantile comparisons, or ablation on the loss weighting) is described to rule out distortion of heavy-tailed or non-stationary transition statistics, which directly risks the weakest assumption identified in the stress-test note.

minor comments (1)

[Abstract] The 465x speedup figure would be more informative if accompanied by details on the hardware configuration used for both the original simulator and surrogate inference.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments on our manuscript. We address each major comment below and outline the revisions we will make to strengthen the presentation of our results while preserving the core contributions.

read point-by-point responses

Referee: [Abstract] Abstract: The abstract asserts 'high fidelity' timing predictions and capture of 'stochastic uncertainty in transition timing' without providing any quantitative metrics (e.g., MAE or RMSE on collapse times, statistical distances between predicted and simulated timing distributions, or ensemble coverage rates). This absence is load-bearing for the central claim of faithful emulation of the underlying stochastic dynamics.

Authors: We agree that the abstract would be strengthened by the inclusion of specific quantitative metrics supporting the claims of high-fidelity timing predictions and stochastic uncertainty capture. In the revised manuscript we will add concise numerical results to the abstract, including MAE and RMSE on predicted collapse times relative to the simulator, as well as a statistical distance (e.g., Wasserstein or Kolmogorov-Smirnov) between the predicted and simulated timing distributions across ensembles. These additions will make the central claims directly quantifiable without altering the abstract's length or focus. revision: yes
Referee: [Methods] Methods (TFT modifications and objective function): The architecture changes (attention, static covariates for parameters/initial conditions) and custom loss are presented as sufficient to reproduce non-stationary stochastic timing from the 21-variable series without systematic bias in collapse distributions. However, no explicit validation (e.g., Kolmogorov-Smirnov tests, quantile comparisons, or ablation on the loss weighting) is described to rule out distortion of heavy-tailed or non-stationary transition statistics, which directly risks the weakest assumption identified in the stress-test note.

Authors: We acknowledge that additional explicit statistical validations would help readers confirm that the modified TFT and custom loss faithfully reproduce the non-stationary and potentially heavy-tailed transition statistics. While the manuscript already reports ensemble-level timing distributions and qualitative agreement with the simulator, we will add in the revised Methods and Results sections: (i) Kolmogorov-Smirnov tests and quantile-quantile comparisons between predicted and simulated collapse-time distributions, (ii) an ablation study isolating the contribution of the custom loss weighting, and (iii) a brief discussion clarifying how these checks address potential distortions in non-stationary statistics. We believe these additions directly mitigate the concern without requiring new experiments. revision: yes

Circularity Check

0 steps flagged

No significant circularity: empirical surrogate trained on external simulation outputs

full rationale

The paper presents a data-driven Temporal Fusion Transformer surrogate trained on outputs from a numerical Earth system simulator. Performance claims (speedup, fidelity on collapse timing, uncertainty capture) are evaluated against held-out simulation ensembles rather than reducing to the model's own fitted parameters or self-citations by construction. No load-bearing self-citation chains, uniqueness theorems, or ansatzes imported from prior author work appear in the provided abstract or description; the central results remain falsifiable against independent simulator runs.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that a neural network trained on simulation trajectories can generalize the stochastic dynamics of tipping events; the model weights themselves are free parameters learned from data.

free parameters (1)

TFT architecture hyperparameters and loss weighting
Modifications to the architecture and objective function are chosen to achieve the reported fidelity; these choices are fitted or tuned on the training ensembles.

axioms (1)

domain assumption Neural networks with attention mechanisms can approximate the mapping from multivariate non-stationary time series plus static covariates to future tipping-event distributions.
Invoked implicitly by training the surrogate to forecast tip events across thousands of time steps.

pith-pipeline@v0.9.0 · 5653 in / 1421 out tokens · 41682 ms · 2026-05-21T07:11:59.197716+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Modifications to the architecture and objective function yield a surrogate that anticipates the timing of Atlantic and Pacific collapses to high fidelity and captures the stochastic uncertainty in transition timing across ensemble predictions.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

59 extracted references · 59 canonical work pages · 3 internal anchors

[1]

Overshooting tipping point thresholds in a changing climate

Paul DL Ritchie et al. “Overshooting tipping point thresholds in a changing climate”. In: Nature592.7855 (2021), pp. 517–523

work page 2021
[2]

Critical slowing down indicators

Fahimeh Nazarimehr et al. “Critical slowing down indicators”. In:Europhysics Letters132.1 (2020), p. 18001

work page 2020
[4]

Early warning of climate tipping points

Timothy M Lenton. “Early warning of climate tipping points”. In:Nature climate change1.4 (2011), pp. 201–209

work page 2011
[5]

Atlantic pycnocline theory scrutinized using a coupled climate model

Anders Levermann and Johannes Jakob Fürst. “Atlantic pycnocline theory scrutinized using a coupled climate model”. In:Geophysical research letters37.14 (2010)

work page 2010
[6]

Jennifer Sleeman et al.Using Artificial Intelligence to aid Scientific Discovery of Climate Tipping Points. Feb. 2023.DOI:10.48550/arXiv.2302.06852

work page doi:10.48550/arxiv.2302.06852 2023
[7]

Jennifer Sleeman et al.A Generative Adversarial Network for Climate Tipping Point Discovery (TIP-GAN). Feb. 2023.DOI:10.48550/arXiv.2302.10274

work page doi:10.48550/arxiv.2302.10274 2023
[8]

Tipping point detection and early warnings in climate, ecological, and human systems

Vasilis Dakos et al. “Tipping point detection and early warnings in climate, ecological, and human systems”. In:Earth System Dynamics15.4 (2024), pp. 1117–1135

work page 2024
[9]

Soft-DTW: a differentiable loss function for time-series

Marco Cuturi and Mathieu Blondel. “Soft-DTW: a differentiable loss function for time-series”. In:Proceedings of the 34th International Conference on Machine Learning - Volume 70. ICML’17. Sydney, NSW, Australia: JMLR.org, 2017, pp. 894–903

work page 2017
[10]

Thermohaline convection with two stable regimes of flow

Henry Stommel. “Thermohaline convection with two stable regimes of flow”. In:Tellus13.2 (1961), pp. 224–230

work page 1961
[11]

Shutdown and recovery of the AMOC in a coupled global climate model: the role of the advective feedback

LC Jackson. “Shutdown and recovery of the AMOC in a coupled global climate model: the role of the advective feedback”. In:Geophysical Research Letters40.6 (2013), pp. 1182–1188

work page 2013
[12]

The effect of Indian Ocean surface freshwater flux biases on the multi-stable regime of the AMOC

Henk A Dijkstra and René M van Westen. “The effect of Indian Ocean surface freshwater flux biases on the multi-stable regime of the AMOC”. In:Tellus, Series A: Dynamic Meteorology and Oceanography76.1 (2024), pp. 90–100

work page 2024
[13]

Closure of the meridional overturning circulation through Southern Ocean upwelling

John Marshall and Kevin Speer. “Closure of the meridional overturning circulation through Southern Ocean upwelling”. In:Nature geoscience5.3 (2012), pp. 171–180

work page 2012
[14]

Tipping Points in Overturning Circulation Mediated by Ocean Mixing and the Configuration and Magnitude of the Hydrological Cycle: A Simple Model

Anand Gnanadesikan et al. “Tipping Points in Overturning Circulation Mediated by Ocean Mixing and the Configuration and Magnitude of the Hydrological Cycle: A Simple Model”. In:Journal of Physical Oceanography54 (Apr. 2024).DOI:10.1175/JPO-D-23-0161.1

work page doi:10.1175/jpo-d-23-0161.1 2024
[15]

Flux correction and overturning stability: Insights from a dynamical box model

Anand Gnanadesikan, Richard Kelson, and Michaela Sten. “Flux correction and overturning stability: Insights from a dynamical box model”. In:Journal of Climate31.22 (2018), pp. 9335– 9350

work page 2018
[16]

An embedded genus-one helicoid.Pro- ceedings of the National Academy of Sciences, 102(46):16566–16568, 2005.doi:10.1073/pnas

Thomas M. Bury et al. “Deep learning for early warning signals of tipping points”. In: Proceedings of the National Academy of Sciences118.39 (2021), e2106140118.DOI: 10. 1073/pnas.2106140118 . eprint: https://www.pnas.org/doi/pdf/10.1073/pnas. 2106140118.URL:https://www.pnas.org/doi/abs/10.1073/pnas.2106140118

work page doi:10.1073/pnas 2021
[17]

Machine learning methods trained on simple models can predict critical tran- sitions in complex natural systems

Smita Deb et al. “Machine learning methods trained on simple models can predict critical tran- sitions in complex natural systems”. In:Royal Society Open Science9 (Feb. 2022), p. 211475. DOI:10.1098/rsos.211475

work page doi:10.1098/rsos.211475 2022
[18]

Early Predictor for the Onset of Critical Transitions in Networked Dynamical Systems

Zijia Liu et al. “Early Predictor for the Onset of Critical Transitions in Networked Dynamical Systems”. In:Physical Review X14 (July 2024).DOI:10.1103/PhysRevX.14.031009

work page doi:10.1103/physrevx.14.031009 2024
[19]

Towards neural Earth system modelling by integrating artificial intelligence in Earth system science

Christopher Irrgang et al. “Towards neural Earth system modelling by integrating artificial intelligence in Earth system science”. In:Nature Machine Intelligence3 (Aug. 2021), pp. 667– 674.DOI:10.1038/s42256-021-00374-3

work page doi:10.1038/s42256-021-00374-3 2021
[20]

Bridging Idealized and Operational Models: An Explainable AI Frame- work for Earth System Emulators

A. Behnoudfar et al. “Bridging Idealized and Operational Models: An Explainable AI Frame- work for Earth System Emulators”. In:arXiv preprint arXiv:2510.13030(2025).URL: https: //arxiv.org/abs/2510.13030

work page arXiv 2025
[21]

Scalable Spatiotemporal Graph Neural Networks

Andrea Cini et al. “Scalable Spatiotemporal Graph Neural Networks”. In:Proceedings of the AAAI Conference on Artificial Intelligence37 (June 2023), pp. 7218–7226.DOI: 10.1609/ aaai.v37i6.25880

work page 2023
[22]

METRO: a generic graph neural network framework for multivariate time series forecasting

Yue Cui et al. “METRO: a generic graph neural network framework for multivariate time series forecasting”. In:Proceedings of the VLDB Endowment15 (Feb. 2022), pp. 224–236. DOI:10.14778/3489496.3489503

work page doi:10.14778/3489496.3489503 2022
[23]

Multi-channel fusion graph neural network for multivariate time series forecasting

Yanzhe Chen and Zongxia Xie. “Multi-channel fusion graph neural network for multivariate time series forecasting”. In:Journal of Computational Science64 (2022), p. 101862.ISSN: 1877-7503.DOI: https : / / doi . org / 10 . 1016 / j . jocs . 2022 . 101862.URL: https : //www.sciencedirect.com/science/article/pii/S1877750322002216. 11

work page 2022
[26]

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Zonghan Wu et al. “Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks”. In:Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. KDD ’20. Virtual Event, CA, USA: Association for Computing Machinery, 2020, pp. 753–763.ISBN: 9781450379984.DOI: 10.1145/3394486. 3403118.URL:https://...

work page doi:10.1145/3394486 2020
[27]

2021.DOI:10.48550/arXiv.2104.14917

Fuxian Li et al.Dynamic Graph Convolutional Recurrent Network for Traffic Prediction: Benchmark and Solution. 2021.DOI:10.48550/arXiv.2104.14917

work page doi:10.48550/arxiv.2104.14917 2021
[28]

Dynamic Causal Explanation Based Diffusion-Variational Graph Neural Network for Spatiotemporal Forecasting

Guojun Liang et al. “Dynamic Causal Explanation Based Diffusion-Variational Graph Neural Network for Spatiotemporal Forecasting”. In:IEEE transactions on neural networks and learning systemsPP (July 2024).DOI:10.1109/TNNLS.2024.3415149

work page doi:10.1109/tnnls.2024.3415149 2024
[29]

June 2022.DOI:10.48550/arXiv.2206.13816

Junchen Ye et al.Learning the Evolutionary and Multi-scale Graph Structure for Multivariate Time Series Forecasting. June 2022.DOI:10.48550/arXiv.2206.13816

work page doi:10.48550/arxiv.2206.13816 2022
[30]

Causal Discovery and Forecasting in Nonstationary Environments with State-Space Models

Biwei Huang et al. “Causal Discovery and Forecasting in Nonstationary Environments with State-Space Models”. In:Proceedings of the 36th International Conference on Machine Learning (ICML). V ol. 97. PMLR, 2019

work page 2019
[31]

Higher-order Granger reservoir computing: Simultaneously achieving scalable complex structures inference and accurate dynamics prediction

Xin Li et al. “Higher-order Granger reservoir computing: Simultaneously achieving scalable complex structures inference and accurate dynamics prediction”. In:Nature Communications 15 (Mar. 2024), p. 2506.DOI:10.1038/s41467-024-46852-1

work page doi:10.1038/s41467-024-46852-1 2024
[32]

Thomas Kipf et al.Neural Relational Inference for Interacting Systems. Feb. 2018.DOI: 10.48550/arXiv.1802.04687

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1802.04687 2018
[33]

Discrete Graph Structure Learning for Forecasting Multi- ple Time Series

Chao Shang, Jie Chen, and Jinbo Bi. “Discrete Graph Structure Learning for Forecasting Multi- ple Time Series”. In:ArXivabs/2101.06861 (2021).URL: https://api.semanticscholar. org/CorpusID:231632580

work page arXiv 2021
[34]

July 2020.DOI:10.48550/arXiv.2007.00267

Elena Saggioro et al.Reconstructing regime-dependent causal relationships from observational time series. July 2020.DOI:10.48550/arXiv.2007.00267

work page doi:10.48550/arxiv.2007.00267 2020
[35]

Causal structure learning for high-dimensional non-stationary time series

Siya Chen, HaoTian Wu, and Guang Jin. “Causal structure learning for high-dimensional non-stationary time series”. In:Knowledge-Based Systems295 (July 2024), p. 111868.DOI: 10.1016/j.knosys.2024.111868

work page doi:10.1016/j.knosys.2024.111868 2024
[36]

doi: 10.1609/aaai.v35i12.17325

Haoyi Zhou et al. “Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting”. In:Proceedings of the AAAI Conference on Artificial Intelligence35 (May 2021), pp. 11106–11115.DOI:10.1609/aaai.v35i12.17325

work page doi:10.1609/aaai.v35i12.17325 2021
[37]

FEDformer: Frequency Enhanced Decomposed Transformer for Long- term Series Forecasting

Tian Zhou et al. “FEDformer: Frequency Enhanced Decomposed Transformer for Long- term Series Forecasting”. In:International Conference on Machine Learning. PMLR. 2022, pp. 27268–27286

work page 2022
[38]

Are Transformers Effective for Time Series Forecasting?

Ailing Zeng et al. “Are Transformers Effective for Time Series Forecasting?” In:Proceedings of the AAAI Conference on Artificial Intelligence37 (June 2023), pp. 11121–11128.DOI: 10.1609/aaai.v37i9.26317

work page doi:10.1609/aaai.v37i9.26317 2023
[39]

Model scale versus domain knowledge in statistical forecasting of chaotic systems

William Gilpin. “Model scale versus domain knowledge in statistical forecasting of chaotic systems”. In:Physical Review Research(2023).URL: https://api.semanticscholar. org/CorpusID:258352864

work page 2023
[40]

Robustness of LSTM neural networks for multi-step forecasting of chaotic time series

Matteo Sangiorgio and Fabio Dercole. “Robustness of LSTM neural networks for multi-step forecasting of chaotic time series”. In:Chaos, Solitons & Fractals139 (Oct. 2020), p. 110045. DOI:10.1016/j.chaos.2020.110045

work page doi:10.1016/j.chaos.2020.110045 2020
[41]

Inferring the dynamics of oscillatory systems using recurrent neural networks

Rok Cestnik and Markus Abel. “Inferring the dynamics of oscillatory systems using recurrent neural networks”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science29 (June 2019), p. 063128.DOI:10.1063/1.5096918. 12

work page doi:10.1063/1.5096918 2019
[42]

Mikhailov

Gonzalo Uribarri and Gabriel Mindlin. “Dynamical time series embeddings in recurrent neural networks”. In:Chaos, Solitons & Fractals154 (Nov. 2021), p. 111612.DOI: 10.1016/j. chaos.2021.111612

work page doi:10.1016/j 2021
[43]

Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach

Jaideep Pathak et al. “Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach”. In:Physical Review Letters120 (Jan. 2018).DOI: 10.1103/PhysRevLett.120.024102

work page doi:10.1103/physrevlett.120.024102 2018
[44]

Deep learning for predicting the occurrence of tipping points

Chengzuo Zhuge, Jiawei Li, and Wei Chen. “Deep learning for predicting the occurrence of tipping points”. In:Royal Society Open Science12.7 (2025), p. 242240.DOI: 10.1098/ rsos.242240 . eprint: https://royalsocietypublishing.org/doi/pdf/10.1098/ rsos . 242240.URL: https : / / royalsocietypublishing . org / doi / abs / 10 . 1098 / rsos.242240

work page 2025
[45]

Extrapolating tipping points and simulating non-stationary dynamics of complex systems using efficient machine learning

Daniel Köglmayr and C. Räth. “Extrapolating tipping points and simulating non-stationary dynamics of complex systems using efficient machine learning”. In:Scientific Reports14 (Jan. 2024).DOI:10.1038/s41598-023-50726-9

work page doi:10.1038/s41598-023-50726-9 2024
[46]

Adaptable reservoir computing: A paradigm for model- free data-driven prediction of critical transitions in nonlinear dynamical systems

Shirin Panahi and Ying-Cheng Lai. “Adaptable reservoir computing: A paradigm for model- free data-driven prediction of critical transitions in nonlinear dynamical systems”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science34 (May 2024).DOI: 10.1063/5.0200898

work page doi:10.1063/5.0200898 2024
[47]

Journal of Mathematical Physics64(9), 091902 (2023) https://doi.org/10.1063/5

Dhruvit Patel and Edward Ott. “Using machine learning to anticipate tipping points and extrapolate to post-tipping dynamics of non-stationary dynamical systems”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science33 (Feb. 2023), p. 023143.DOI: 10.1063/5. 0131787

work page doi:10.1063/5 2023
[48]

DyAt Nets: Dynamic Attention Networks for State Forecasting in Cyber-Physical Systems

Nikhil Muralidhar, Sathappan Muthiah, and Naren Ramakrishnan. “DyAt Nets: Dynamic Attention Networks for State Forecasting in Cyber-Physical Systems”. In:Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19. 2019, pp. 3180–3186.DOI:10.24963/ijcai.2019/441

work page doi:10.24963/ijcai.2019/441 2019
[49]

Deep learning-based time series forecasting

Xiaobao Song et al. “Deep learning-based time series forecasting”. In:Artificial Intelligence Review58 (Nov. 2024).DOI:10.1007/s10462-024-10989-8

work page doi:10.1007/s10462-024-10989-8 2024
[50]

A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series

Li Cao et al. “A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series”. In:Computational intelligence and neuroscience(July 2021).DOI: 10.1155/ 2021/6911192

work page 2021
[51]

Multidimensional dynamic attention for multivariate time series forecasting

Sarah Almaghrabi et al. “Multidimensional dynamic attention for multivariate time series forecasting”. In:Applied Soft Computing167 (Oct. 2024), p. 112350.DOI: 10.1016/j.asoc. 2024.112350

work page doi:10.1016/j.asoc 2024
[52]

A dual-stage attention-based recurrent neural network for time series prediction

Yao Qin et al. “A dual-stage attention-based recurrent neural network for time series prediction”. In:Proceedings of the 26th International Joint Conference on Artificial Intelligence. IJCAI’17. Melbourne, Australia: AAAI Press, 2017, pp. 2627–2633.ISBN: 9780999241103

work page 2017
[53]

Simba: Simplified mamba-based architecture for vision and multivariate time series

Badri Narayana Patro and Vijay Agneeswaran. “SiMBA: Simplified Mamba-Based Archi- tecture for Vision and Multivariate Time series”. In:ArXivabs/2403.15360 (2024).URL: https://api.semanticscholar.org/CorpusID:268666944

work page arXiv 2024
[54]

CMMamba: channel mixing Mamba for time series forecasting

Qiang Li et al. “CMMamba: channel mixing Mamba for time series forecasting”. In:Journal of Big Data11 (Oct. 2024).DOI:10.1186/s40537-024-01001-9

work page doi:10.1186/s40537-024-01001-9 2024
[55]

Temporal Fusion Transformers for interpretable multi-horizon time series forecasting

Bryan Lim et al. “Temporal Fusion Transformers for interpretable multi-horizon time series forecasting”. In:International Journal of Forecasting37 (June 2021).DOI: 10 . 1016 / j . ijforecast.2021.03.012

work page 2021
[56]

SegRNN: Segment Recurrent Neural Network for Long-Term Time- Series Forecasting

Shengsheng Lin et al. “SegRNN: Segment Recurrent Neural Network for Long-Term Time- Series Forecasting”. In:IEEE Internet of Things Journal13 (2023), pp. 9861–9871.URL: https://api.semanticscholar.org/CorpusID:261064627

work page 2023
[57]

iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Yong Liu et al. “iTransformer: Inverted Transformers Are Effective for Time Series Forecast- ing”. In:arXiv preprint arXiv:2310.06625(2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023
[58]

Long-term forecasting with tide: Time-series dense encoder,

Abhimanyu Das et al. “Long-term Forecasting with TiDE: Time-series Dense Encoder”. In: ArXivabs/2304.08424 (2023).URL:https://arxiv.org/abs/2304.08424

work page arXiv 2023
[59]

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Albert Gu and Tri Dao. “Mamba: Linear-Time Sequence Modeling with Selective State Spaces”. In:ArXivabs/2312.00752 (2023).URL: https://api.semanticscholar.org/ CorpusID:265551773. 13

work page internal anchor Pith review Pith/arXiv arXiv 2023
[60]

Timexer: Empowering transformers for time series forecasting with exogenous variables

Yuxuan Wang et al. “Timexer: Empowering transformers for time series forecasting with exogenous variables”. In:Advances in Neural Information Processing Systems(2024)

work page 2024
[61]

Kolle et al.Climate data (30min resolution) from the climate station in The Jena Experiment

O. Kolle et al.Climate data (30min resolution) from the climate station in The Jena Experiment. 2021.URL:https://jexis.idiv.de/

work page 2021
[62]

Defining and Measuring Chaos

George Datseris and Ulrich Parlitz. “Defining and Measuring Chaos”. In:Nonlinear Dynamics: A Concise Introduction Interlaced with Code. Cham: Springer International Publishing, 2022, pp. 37–52.ISBN: 978-3-030-91032-7.DOI: 10 . 1007 / 978 - 3 - 030 - 91032 - 7 _ 3.URL: https://doi.org/10.1007/978-3-030-91032-7_3. 14

work page doi:10.1007/978-3-030-91032-7_3 2022

[1] [1]

Overshooting tipping point thresholds in a changing climate

Paul DL Ritchie et al. “Overshooting tipping point thresholds in a changing climate”. In: Nature592.7855 (2021), pp. 517–523

work page 2021

[2] [2]

Critical slowing down indicators

Fahimeh Nazarimehr et al. “Critical slowing down indicators”. In:Europhysics Letters132.1 (2020), p. 18001

work page 2020

[3] [4]

Early warning of climate tipping points

Timothy M Lenton. “Early warning of climate tipping points”. In:Nature climate change1.4 (2011), pp. 201–209

work page 2011

[4] [5]

Atlantic pycnocline theory scrutinized using a coupled climate model

Anders Levermann and Johannes Jakob Fürst. “Atlantic pycnocline theory scrutinized using a coupled climate model”. In:Geophysical research letters37.14 (2010)

work page 2010

[5] [6]

Jennifer Sleeman et al.Using Artificial Intelligence to aid Scientific Discovery of Climate Tipping Points. Feb. 2023.DOI:10.48550/arXiv.2302.06852

work page doi:10.48550/arxiv.2302.06852 2023

[6] [7]

Jennifer Sleeman et al.A Generative Adversarial Network for Climate Tipping Point Discovery (TIP-GAN). Feb. 2023.DOI:10.48550/arXiv.2302.10274

work page doi:10.48550/arxiv.2302.10274 2023

[7] [8]

Tipping point detection and early warnings in climate, ecological, and human systems

Vasilis Dakos et al. “Tipping point detection and early warnings in climate, ecological, and human systems”. In:Earth System Dynamics15.4 (2024), pp. 1117–1135

work page 2024

[8] [9]

Soft-DTW: a differentiable loss function for time-series

Marco Cuturi and Mathieu Blondel. “Soft-DTW: a differentiable loss function for time-series”. In:Proceedings of the 34th International Conference on Machine Learning - Volume 70. ICML’17. Sydney, NSW, Australia: JMLR.org, 2017, pp. 894–903

work page 2017

[9] [10]

Thermohaline convection with two stable regimes of flow

Henry Stommel. “Thermohaline convection with two stable regimes of flow”. In:Tellus13.2 (1961), pp. 224–230

work page 1961

[10] [11]

Shutdown and recovery of the AMOC in a coupled global climate model: the role of the advective feedback

LC Jackson. “Shutdown and recovery of the AMOC in a coupled global climate model: the role of the advective feedback”. In:Geophysical Research Letters40.6 (2013), pp. 1182–1188

work page 2013

[11] [12]

The effect of Indian Ocean surface freshwater flux biases on the multi-stable regime of the AMOC

Henk A Dijkstra and René M van Westen. “The effect of Indian Ocean surface freshwater flux biases on the multi-stable regime of the AMOC”. In:Tellus, Series A: Dynamic Meteorology and Oceanography76.1 (2024), pp. 90–100

work page 2024

[12] [13]

Closure of the meridional overturning circulation through Southern Ocean upwelling

John Marshall and Kevin Speer. “Closure of the meridional overturning circulation through Southern Ocean upwelling”. In:Nature geoscience5.3 (2012), pp. 171–180

work page 2012

[13] [14]

Tipping Points in Overturning Circulation Mediated by Ocean Mixing and the Configuration and Magnitude of the Hydrological Cycle: A Simple Model

Anand Gnanadesikan et al. “Tipping Points in Overturning Circulation Mediated by Ocean Mixing and the Configuration and Magnitude of the Hydrological Cycle: A Simple Model”. In:Journal of Physical Oceanography54 (Apr. 2024).DOI:10.1175/JPO-D-23-0161.1

work page doi:10.1175/jpo-d-23-0161.1 2024

[14] [15]

Flux correction and overturning stability: Insights from a dynamical box model

Anand Gnanadesikan, Richard Kelson, and Michaela Sten. “Flux correction and overturning stability: Insights from a dynamical box model”. In:Journal of Climate31.22 (2018), pp. 9335– 9350

work page 2018

[15] [16]

An embedded genus-one helicoid.Pro- ceedings of the National Academy of Sciences, 102(46):16566–16568, 2005.doi:10.1073/pnas

Thomas M. Bury et al. “Deep learning for early warning signals of tipping points”. In: Proceedings of the National Academy of Sciences118.39 (2021), e2106140118.DOI: 10. 1073/pnas.2106140118 . eprint: https://www.pnas.org/doi/pdf/10.1073/pnas. 2106140118.URL:https://www.pnas.org/doi/abs/10.1073/pnas.2106140118

work page doi:10.1073/pnas 2021

[16] [17]

Machine learning methods trained on simple models can predict critical tran- sitions in complex natural systems

Smita Deb et al. “Machine learning methods trained on simple models can predict critical tran- sitions in complex natural systems”. In:Royal Society Open Science9 (Feb. 2022), p. 211475. DOI:10.1098/rsos.211475

work page doi:10.1098/rsos.211475 2022

[17] [18]

Early Predictor for the Onset of Critical Transitions in Networked Dynamical Systems

Zijia Liu et al. “Early Predictor for the Onset of Critical Transitions in Networked Dynamical Systems”. In:Physical Review X14 (July 2024).DOI:10.1103/PhysRevX.14.031009

work page doi:10.1103/physrevx.14.031009 2024

[18] [19]

Towards neural Earth system modelling by integrating artificial intelligence in Earth system science

Christopher Irrgang et al. “Towards neural Earth system modelling by integrating artificial intelligence in Earth system science”. In:Nature Machine Intelligence3 (Aug. 2021), pp. 667– 674.DOI:10.1038/s42256-021-00374-3

work page doi:10.1038/s42256-021-00374-3 2021

[19] [20]

Bridging Idealized and Operational Models: An Explainable AI Frame- work for Earth System Emulators

A. Behnoudfar et al. “Bridging Idealized and Operational Models: An Explainable AI Frame- work for Earth System Emulators”. In:arXiv preprint arXiv:2510.13030(2025).URL: https: //arxiv.org/abs/2510.13030

work page arXiv 2025

[20] [21]

Scalable Spatiotemporal Graph Neural Networks

Andrea Cini et al. “Scalable Spatiotemporal Graph Neural Networks”. In:Proceedings of the AAAI Conference on Artificial Intelligence37 (June 2023), pp. 7218–7226.DOI: 10.1609/ aaai.v37i6.25880

work page 2023

[21] [22]

METRO: a generic graph neural network framework for multivariate time series forecasting

Yue Cui et al. “METRO: a generic graph neural network framework for multivariate time series forecasting”. In:Proceedings of the VLDB Endowment15 (Feb. 2022), pp. 224–236. DOI:10.14778/3489496.3489503

work page doi:10.14778/3489496.3489503 2022

[22] [23]

Multi-channel fusion graph neural network for multivariate time series forecasting

Yanzhe Chen and Zongxia Xie. “Multi-channel fusion graph neural network for multivariate time series forecasting”. In:Journal of Computational Science64 (2022), p. 101862.ISSN: 1877-7503.DOI: https : / / doi . org / 10 . 1016 / j . jocs . 2022 . 101862.URL: https : //www.sciencedirect.com/science/article/pii/S1877750322002216. 11

work page 2022

[23] [26]

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Zonghan Wu et al. “Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks”. In:Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. KDD ’20. Virtual Event, CA, USA: Association for Computing Machinery, 2020, pp. 753–763.ISBN: 9781450379984.DOI: 10.1145/3394486. 3403118.URL:https://...

work page doi:10.1145/3394486 2020

[24] [27]

2021.DOI:10.48550/arXiv.2104.14917

Fuxian Li et al.Dynamic Graph Convolutional Recurrent Network for Traffic Prediction: Benchmark and Solution. 2021.DOI:10.48550/arXiv.2104.14917

work page doi:10.48550/arxiv.2104.14917 2021

[25] [28]

Dynamic Causal Explanation Based Diffusion-Variational Graph Neural Network for Spatiotemporal Forecasting

Guojun Liang et al. “Dynamic Causal Explanation Based Diffusion-Variational Graph Neural Network for Spatiotemporal Forecasting”. In:IEEE transactions on neural networks and learning systemsPP (July 2024).DOI:10.1109/TNNLS.2024.3415149

work page doi:10.1109/tnnls.2024.3415149 2024

[26] [29]

June 2022.DOI:10.48550/arXiv.2206.13816

Junchen Ye et al.Learning the Evolutionary and Multi-scale Graph Structure for Multivariate Time Series Forecasting. June 2022.DOI:10.48550/arXiv.2206.13816

work page doi:10.48550/arxiv.2206.13816 2022

[27] [30]

Causal Discovery and Forecasting in Nonstationary Environments with State-Space Models

Biwei Huang et al. “Causal Discovery and Forecasting in Nonstationary Environments with State-Space Models”. In:Proceedings of the 36th International Conference on Machine Learning (ICML). V ol. 97. PMLR, 2019

work page 2019

[28] [31]

Higher-order Granger reservoir computing: Simultaneously achieving scalable complex structures inference and accurate dynamics prediction

Xin Li et al. “Higher-order Granger reservoir computing: Simultaneously achieving scalable complex structures inference and accurate dynamics prediction”. In:Nature Communications 15 (Mar. 2024), p. 2506.DOI:10.1038/s41467-024-46852-1

work page doi:10.1038/s41467-024-46852-1 2024

[29] [32]

Thomas Kipf et al.Neural Relational Inference for Interacting Systems. Feb. 2018.DOI: 10.48550/arXiv.1802.04687

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1802.04687 2018

[30] [33]

Discrete Graph Structure Learning for Forecasting Multi- ple Time Series

Chao Shang, Jie Chen, and Jinbo Bi. “Discrete Graph Structure Learning for Forecasting Multi- ple Time Series”. In:ArXivabs/2101.06861 (2021).URL: https://api.semanticscholar. org/CorpusID:231632580

work page arXiv 2021

[31] [34]

July 2020.DOI:10.48550/arXiv.2007.00267

Elena Saggioro et al.Reconstructing regime-dependent causal relationships from observational time series. July 2020.DOI:10.48550/arXiv.2007.00267

work page doi:10.48550/arxiv.2007.00267 2020

[32] [35]

Causal structure learning for high-dimensional non-stationary time series

Siya Chen, HaoTian Wu, and Guang Jin. “Causal structure learning for high-dimensional non-stationary time series”. In:Knowledge-Based Systems295 (July 2024), p. 111868.DOI: 10.1016/j.knosys.2024.111868

work page doi:10.1016/j.knosys.2024.111868 2024

[33] [36]

doi: 10.1609/aaai.v35i12.17325

Haoyi Zhou et al. “Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting”. In:Proceedings of the AAAI Conference on Artificial Intelligence35 (May 2021), pp. 11106–11115.DOI:10.1609/aaai.v35i12.17325

work page doi:10.1609/aaai.v35i12.17325 2021

[34] [37]

FEDformer: Frequency Enhanced Decomposed Transformer for Long- term Series Forecasting

Tian Zhou et al. “FEDformer: Frequency Enhanced Decomposed Transformer for Long- term Series Forecasting”. In:International Conference on Machine Learning. PMLR. 2022, pp. 27268–27286

work page 2022

[35] [38]

Are Transformers Effective for Time Series Forecasting?

Ailing Zeng et al. “Are Transformers Effective for Time Series Forecasting?” In:Proceedings of the AAAI Conference on Artificial Intelligence37 (June 2023), pp. 11121–11128.DOI: 10.1609/aaai.v37i9.26317

work page doi:10.1609/aaai.v37i9.26317 2023

[36] [39]

Model scale versus domain knowledge in statistical forecasting of chaotic systems

William Gilpin. “Model scale versus domain knowledge in statistical forecasting of chaotic systems”. In:Physical Review Research(2023).URL: https://api.semanticscholar. org/CorpusID:258352864

work page 2023

[37] [40]

Robustness of LSTM neural networks for multi-step forecasting of chaotic time series

Matteo Sangiorgio and Fabio Dercole. “Robustness of LSTM neural networks for multi-step forecasting of chaotic time series”. In:Chaos, Solitons & Fractals139 (Oct. 2020), p. 110045. DOI:10.1016/j.chaos.2020.110045

work page doi:10.1016/j.chaos.2020.110045 2020

[38] [41]

Inferring the dynamics of oscillatory systems using recurrent neural networks

Rok Cestnik and Markus Abel. “Inferring the dynamics of oscillatory systems using recurrent neural networks”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science29 (June 2019), p. 063128.DOI:10.1063/1.5096918. 12

work page doi:10.1063/1.5096918 2019

[39] [42]

Mikhailov

Gonzalo Uribarri and Gabriel Mindlin. “Dynamical time series embeddings in recurrent neural networks”. In:Chaos, Solitons & Fractals154 (Nov. 2021), p. 111612.DOI: 10.1016/j. chaos.2021.111612

work page doi:10.1016/j 2021

[40] [43]

Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach

Jaideep Pathak et al. “Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach”. In:Physical Review Letters120 (Jan. 2018).DOI: 10.1103/PhysRevLett.120.024102

work page doi:10.1103/physrevlett.120.024102 2018

[41] [44]

Deep learning for predicting the occurrence of tipping points

Chengzuo Zhuge, Jiawei Li, and Wei Chen. “Deep learning for predicting the occurrence of tipping points”. In:Royal Society Open Science12.7 (2025), p. 242240.DOI: 10.1098/ rsos.242240 . eprint: https://royalsocietypublishing.org/doi/pdf/10.1098/ rsos . 242240.URL: https : / / royalsocietypublishing . org / doi / abs / 10 . 1098 / rsos.242240

work page 2025

[42] [45]

Extrapolating tipping points and simulating non-stationary dynamics of complex systems using efficient machine learning

Daniel Köglmayr and C. Räth. “Extrapolating tipping points and simulating non-stationary dynamics of complex systems using efficient machine learning”. In:Scientific Reports14 (Jan. 2024).DOI:10.1038/s41598-023-50726-9

work page doi:10.1038/s41598-023-50726-9 2024

[43] [46]

Adaptable reservoir computing: A paradigm for model- free data-driven prediction of critical transitions in nonlinear dynamical systems

Shirin Panahi and Ying-Cheng Lai. “Adaptable reservoir computing: A paradigm for model- free data-driven prediction of critical transitions in nonlinear dynamical systems”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science34 (May 2024).DOI: 10.1063/5.0200898

work page doi:10.1063/5.0200898 2024

[44] [47]

Journal of Mathematical Physics64(9), 091902 (2023) https://doi.org/10.1063/5

Dhruvit Patel and Edward Ott. “Using machine learning to anticipate tipping points and extrapolate to post-tipping dynamics of non-stationary dynamical systems”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science33 (Feb. 2023), p. 023143.DOI: 10.1063/5. 0131787

work page doi:10.1063/5 2023

[45] [48]

DyAt Nets: Dynamic Attention Networks for State Forecasting in Cyber-Physical Systems

Nikhil Muralidhar, Sathappan Muthiah, and Naren Ramakrishnan. “DyAt Nets: Dynamic Attention Networks for State Forecasting in Cyber-Physical Systems”. In:Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19. 2019, pp. 3180–3186.DOI:10.24963/ijcai.2019/441

work page doi:10.24963/ijcai.2019/441 2019

[46] [49]

Deep learning-based time series forecasting

Xiaobao Song et al. “Deep learning-based time series forecasting”. In:Artificial Intelligence Review58 (Nov. 2024).DOI:10.1007/s10462-024-10989-8

work page doi:10.1007/s10462-024-10989-8 2024

[47] [50]

A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series

Li Cao et al. “A Multiattention-Based Supervised Feature Selection Method for Multivariate Time Series”. In:Computational intelligence and neuroscience(July 2021).DOI: 10.1155/ 2021/6911192

work page 2021

[48] [51]

Multidimensional dynamic attention for multivariate time series forecasting

Sarah Almaghrabi et al. “Multidimensional dynamic attention for multivariate time series forecasting”. In:Applied Soft Computing167 (Oct. 2024), p. 112350.DOI: 10.1016/j.asoc. 2024.112350

work page doi:10.1016/j.asoc 2024

[49] [52]

A dual-stage attention-based recurrent neural network for time series prediction

Yao Qin et al. “A dual-stage attention-based recurrent neural network for time series prediction”. In:Proceedings of the 26th International Joint Conference on Artificial Intelligence. IJCAI’17. Melbourne, Australia: AAAI Press, 2017, pp. 2627–2633.ISBN: 9780999241103

work page 2017

[50] [53]

Simba: Simplified mamba-based architecture for vision and multivariate time series

Badri Narayana Patro and Vijay Agneeswaran. “SiMBA: Simplified Mamba-Based Archi- tecture for Vision and Multivariate Time series”. In:ArXivabs/2403.15360 (2024).URL: https://api.semanticscholar.org/CorpusID:268666944

work page arXiv 2024

[51] [54]

CMMamba: channel mixing Mamba for time series forecasting

Qiang Li et al. “CMMamba: channel mixing Mamba for time series forecasting”. In:Journal of Big Data11 (Oct. 2024).DOI:10.1186/s40537-024-01001-9

work page doi:10.1186/s40537-024-01001-9 2024

[52] [55]

Temporal Fusion Transformers for interpretable multi-horizon time series forecasting

Bryan Lim et al. “Temporal Fusion Transformers for interpretable multi-horizon time series forecasting”. In:International Journal of Forecasting37 (June 2021).DOI: 10 . 1016 / j . ijforecast.2021.03.012

work page 2021

[53] [56]

SegRNN: Segment Recurrent Neural Network for Long-Term Time- Series Forecasting

Shengsheng Lin et al. “SegRNN: Segment Recurrent Neural Network for Long-Term Time- Series Forecasting”. In:IEEE Internet of Things Journal13 (2023), pp. 9861–9871.URL: https://api.semanticscholar.org/CorpusID:261064627

work page 2023

[54] [57]

iTransformer: Inverted Transformers Are Effective for Time Series Forecasting

Yong Liu et al. “iTransformer: Inverted Transformers Are Effective for Time Series Forecast- ing”. In:arXiv preprint arXiv:2310.06625(2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023

[55] [58]

Long-term forecasting with tide: Time-series dense encoder,

Abhimanyu Das et al. “Long-term Forecasting with TiDE: Time-series Dense Encoder”. In: ArXivabs/2304.08424 (2023).URL:https://arxiv.org/abs/2304.08424

work page arXiv 2023

[56] [59]

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Albert Gu and Tri Dao. “Mamba: Linear-Time Sequence Modeling with Selective State Spaces”. In:ArXivabs/2312.00752 (2023).URL: https://api.semanticscholar.org/ CorpusID:265551773. 13

work page internal anchor Pith review Pith/arXiv arXiv 2023

[57] [60]

Timexer: Empowering transformers for time series forecasting with exogenous variables

Yuxuan Wang et al. “Timexer: Empowering transformers for time series forecasting with exogenous variables”. In:Advances in Neural Information Processing Systems(2024)

work page 2024

[58] [61]

Kolle et al.Climate data (30min resolution) from the climate station in The Jena Experiment

O. Kolle et al.Climate data (30min resolution) from the climate station in The Jena Experiment. 2021.URL:https://jexis.idiv.de/

work page 2021

[59] [62]

Defining and Measuring Chaos

George Datseris and Ulrich Parlitz. “Defining and Measuring Chaos”. In:Nonlinear Dynamics: A Concise Introduction Interlaced with Code. Cham: Springer International Publishing, 2022, pp. 37–52.ISBN: 978-3-030-91032-7.DOI: 10 . 1007 / 978 - 3 - 030 - 91032 - 7 _ 3.URL: https://doi.org/10.1007/978-3-030-91032-7_3. 14

work page doi:10.1007/978-3-030-91032-7_3 2022