pith. machine review for the scientific record. sign in

arxiv: 2602.08318 · v2 · submitted 2026-02-09 · 📊 stat.ML · cs.LG· nlin.CD

Recognition: 2 theorem links

· Lean Theorem

Is Flow Matching Just Trajectory Replay for Sequential Data?

Authors on Pith no claims yet

Pith reviewed 2026-05-16 06:19 UTC · model grok-4.3

classification 📊 stat.ML cs.LGnlin.CD
keywords flow matchingtime series forecastingdynamical systemsvelocity fieldnonparametric inferenceODE samplingtrajectory replay
0
0 comments X

The pith

Flow matching on sequential data learns a nonparametric ODE that replays observed transitions via similarity-weighted velocity mixtures.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper derives the exact velocity field that flow matching targets for time series data under perfect approximation. For common Gaussian paths, this field defines an ODE whose dynamics amount to a memory-augmented system that interpolates velocities from similar past transitions. A sympathetic reader cares because the result makes explicit that FM performs a form of continuous trajectory replay rather than abstract dynamical learning, while also yielding a simple closed-form sampler that works without training. This positions trained neural models as approximations to an ideal nonparametric solution.

Core claim

In the limit of perfect function approximation the empirical flow matching objective on sequential data with Gaussian conditional paths targets a velocity field that defines an ODE dynamics equivalent to a nonparametric memory-augmented continuous-time dynamical system; the optimal field admits a closed-form expression as a similarity-weighted mixture of instantaneous velocities induced by observed transitions.

What carries the argument

The similarity-weighted mixture of instantaneous velocities from observed transitions, which carries the argument by making the dataset dependence of the optimal sampler explicit and interpretable.

Load-bearing premise

The analysis assumes the limit of perfect function approximation together with Gaussian conditional paths; if models have limited capacity or non-Gaussian paths are needed, the closed-form characterization may not hold.

What would settle it

Train a high-capacity neural flow matching model on a simple nonlinear dynamical system and compare its generated trajectories to those from the closed-form FreeFM sampler; large systematic differences in the distributions would falsify the claim that the neural model approximates the derived nonparametric solution.

Figures

Figures reproduced from arXiv: 2602.08318 by Michael W. Mahoney, N. Benjamin Erichson, Shizheng Lin, Soon Hoe Lim.

Figure 1
Figure 1. Figure 1: What Dynamical System Are FM Forecasters Actually Sampling From? For sequential data, optimal empirical FM induces a nonparametric, memory-augmented ODE, enabling training-free forecasting by replaying historical transitions. Inspired by this theoretical insight, we propose an ODE sampler dZt dt = GtZt + h(t, Zt ; DM) (see (13)), where the velocity field combines a global linear drift GtZt with a data-adap… view at source ↗
Figure 2
Figure 2. Figure 2: Conditional Forecast. (a) Examples of conditional forecasts generated by FreeFM and baseline models for 20 trajectories from the Aizawa attractor. Each trajectory originates from a different initial condition. (b) sMAPE and VPT of conditional forecast results from FreeFM and baseline models. Shaded regions indicate ±0.5 standard error over 135 dynamical systems, each with 20 trajectories originating from r… view at source ↗
Figure 3
Figure 3. Figure 3: Probabilistic Forecast. (a)-(b) Examples of probabilistic forecast generated by FreeFM and fully trained vanilla flow matching model for time series from Lorenz-63. Error shadows are standard error over 50 Monte-Carlo simulations. (c) sMAPE and CRPS of prob￾abilistic forecast results from FreeFM and fully trained vanilla flow matching model. Error shadows are 0.5 standard error over 135 dynamical systems w… view at source ↗
Figure 4
Figure 4. Figure 4: Long Term Attractor Reconstruction. (a) Correlation dimensions of long term attractor reconstruction result from FreeFM and baseline models. (b) KL divergence of long term attractor reconstruction result from FreeFM and baseline models. Error shadows are 0.5 standard error. Both results are presented over 135 dynamical systems, each has 20 trajectories originated from 20 random initial conditions. The resu… view at source ↗
Figure 5
Figure 5. Figure 5: Illustration of Dynamical Measure Transport via Flow Matching (FM). The schematic depicts the continuous transport of a probability measure from a source to a target distribution. (Left) The process initializes with a standard Gaussian source measure p0(z) = N (0, I). (Middle) The FM objective defines a vector field vt(z) that drives the transport. The resulting ODE flow dz/dt = vt(z) pushes the probabilit… view at source ↗
Figure 6
Figure 6. Figure 6: FM with the Brownian-Bridge-Like Target for Sequence Prediction. This figure illustrates how FM with the proposed target probability path in [70] induces KDE-to￾KDE transport, allowing us to obtain the distribution for the state in the next time point xτ+1, given a distribution centered around the current point xτ . Top Panel: Sequence data points (. . . , xτ , xτ+1, . . .), indexed by τ , on a curve. The … view at source ↗
Figure 7
Figure 7. Figure 7: Computational Cost. (a) FLOPs Comparison during training phase. (b) FLOPs Comparison during inference phase. All the FLOPs are computed among 6240 samples [PITH_FULL_IMAGE:figures/full_fig_p048_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: H´enon–Heiles System. generated trajectories. In this ablation study, we examine different integrator configurations, including the number of integration steps and the choice of ODE solvers, to investigate whether our model behaves consistently with standard flow matching models across various settings. We conduct the ablation study under the same setting as Sec. 4.3, except that we vary the number of ODE … view at source ↗
Figure 9
Figure 9. Figure 9: Sprott G System [PITH_FULL_IMAGE:figures/full_fig_p049_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Isothermal Chemical Process. 49 [PITH_FULL_IMAGE:figures/full_fig_p049_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Jerk Circuit Oscillator [PITH_FULL_IMAGE:figures/full_fig_p050_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Forced Brusselator System. 50 [PITH_FULL_IMAGE:figures/full_fig_p050_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Ablation Study. (a) Ablation study for ODE integration steps: [30, 50, 100]. (b) Ablation study for 3 different ODE solver: [Euler, Runge–Kutta,Exponential Euler]. (c) Ablation study for truncation model, truncation number R = 256. Shaded regions from Fig (a)-(c) indicate ±0.5 standard error over 135 dynamical systems, each with 20 trajectories originating from randomly sampled initial conditions. 51 [PI… view at source ↗
read the original abstract

Flow matching (FM) is increasingly used in scientific domains for time series generation and forecasting, where data often arise from underlying dynamical systems. However, it is not well-understood whether it learns transferable dynamical structure or simply performs an effective "trajectory replay". We study this question by deriving the velocity field targeted by the empirical FM objective on sequential data in the limit of perfect function approximation. For the Gaussian conditional paths commonly used in practice, we show that the implied sampler is an ODE whose dynamics constitutes a nonparametric, memory-augmented continuous-time dynamical system. The optimal field admits a closed-form expression as a similarity-weighted mixture of instantaneous velocities induced by observed transitions, making the dataset dependence explicit and interpretable. This characterization positions neural FM models as parametric surrogates of an ideal nonparametric solution and suggests practical approximation schemes for robust ODE-based generation. As a byproduct of our analysis, the resulting closed-form sampler, FreeFM, provides strong probabilistic forecasts on nonlinear dynamical system benchmarks directly from historical transitions, without training.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper derives the velocity field targeted by the empirical flow-matching objective on sequential data under perfect function approximation and Gaussian conditional paths. It shows that the implied ODE sampler is a nonparametric, memory-augmented continuous-time dynamical system whose optimal field is a closed-form similarity-weighted mixture of instantaneous velocities from observed transitions. This positions neural FM models as parametric surrogates of an ideal nonparametric solution and introduces the training-free FreeFM sampler, which is claimed to yield strong probabilistic forecasts on nonlinear dynamical system benchmarks directly from historical transitions.

Significance. If the derivation holds, the result supplies an explicit, interpretable characterization of flow matching for time-series data that connects it to dynamical systems theory and supplies a nonparametric baseline. The closed-form expression and the FreeFM byproduct are concrete strengths that could guide both theoretical understanding and practical approximation schemes for ODE-based generation on sequential data.

major comments (3)
  1. [§3] §3 (derivation of optimal field): the central claim that the velocity field equals the similarity-weighted mixture holds only in the limit of perfect function approximation. No approximation-error bounds or capacity analysis are supplied for finite neural networks, which is load-bearing for the claim that neural FM models act as parametric surrogates of the nonparametric solution.
  2. [§4] §4 (FreeFM benchmarks): the assertion of 'strong probabilistic forecasts' on nonlinear dynamical system benchmarks is presented without reported quantitative metrics, baseline comparisons, or error analysis. This weakens the empirical support for the practical utility of the closed-form sampler.
  3. [§2] Assumption of Gaussian conditional paths (stated in abstract and §2): the closed-form characterization is derived specifically for this choice. No discussion or counter-example is given for non-Gaussian paths that are sometimes used in practice, leaving open whether the memory-augmented interpretation generalizes.
minor comments (2)
  1. [§3] Notation for the similarity kernel and weighting function is introduced without an explicit equation reference in the main text; adding a numbered display equation would improve clarity.
  2. [§4] The abstract states that FreeFM 'provides strong probabilistic forecasts... without training,' yet the experimental section does not include a direct comparison table against standard FM or other baselines; a compact results table would strengthen the presentation.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive review and positive assessment of the paper's potential contribution. We address each major comment point by point below and describe the planned revisions.

read point-by-point responses
  1. Referee: [§3] §3 (derivation of optimal field): the central claim that the velocity field equals the similarity-weighted mixture holds only in the limit of perfect function approximation. No approximation-error bounds or capacity analysis are supplied for finite neural networks, which is load-bearing for the claim that neural FM models act as parametric surrogates of the nonparametric solution.

    Authors: We agree that the derivation in §3 is performed under the assumption of perfect function approximation, as stated at the beginning of that section. This limit yields the explicit closed-form target that neural networks are trained to match. No explicit error bounds for finite-capacity networks are derived. In the revision we will expand the discussion in §3 to clarify that neural FM models act as parametric surrogates to this ideal nonparametric field, reference relevant neural ODE approximation results, and note the lack of finite-sample bounds as a limitation and direction for future work. revision: partial

  2. Referee: [§4] §4 (FreeFM benchmarks): the assertion of 'strong probabilistic forecasts' on nonlinear dynamical system benchmarks is presented without reported quantitative metrics, baseline comparisons, or error analysis. This weakens the empirical support for the practical utility of the closed-form sampler.

    Authors: We accept that the current presentation of FreeFM results relies on qualitative description. To strengthen the empirical section, the revised manuscript will include a table of quantitative metrics (MSE, CRPS) on the nonlinear dynamical system benchmarks, direct comparisons against standard baselines (linear AR, RNN, trained flow matching), and error bars obtained from multiple random seeds. revision: yes

  3. Referee: [§2] Assumption of Gaussian conditional paths (stated in abstract and §2): the closed-form characterization is derived specifically for this choice. No discussion or counter-example is given for non-Gaussian paths that are sometimes used in practice, leaving open whether the memory-augmented interpretation generalizes.

    Authors: The derivation relies on the Gaussian property to obtain the closed-form similarity-weighted mixture. We will revise §2 to state explicitly that the memory-augmented dynamical-system interpretation is tied to the Gaussian conditional paths commonly used in flow matching, and to note that non-Gaussian paths generally do not admit the same closed-form expression, leaving generalization as an open question. revision: yes

Circularity Check

0 steps flagged

Derivation from empirical FM objective to closed-form mixture is direct and non-circular

full rationale

The paper starts from the standard empirical flow-matching objective and, under the stated limit of perfect function approximation with Gaussian conditional paths, algebraically obtains the optimal velocity field as a similarity-weighted mixture of observed transition velocities. This is a mathematical characterization of the nonparametric solution implied by the objective itself, not a redefinition, a fitted parameter renamed as prediction, or a result justified only by self-citation. No load-bearing step reduces to its own inputs by construction, and the central claim remains independent of any prior author work invoked for uniqueness or ansatz.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the mathematical limit of perfect function approximation and the standard choice of Gaussian conditional paths; these are domain assumptions rather than new postulates.

axioms (2)
  • domain assumption limit of perfect function approximation
    The velocity-field derivation is performed under the assumption that the model can exactly realize the target field.
  • domain assumption Gaussian conditional paths are used
    The closed-form mixture expression is derived specifically for the Gaussian paths commonly employed in flow matching.

pith-pipeline@v0.9.0 · 5483 in / 1399 out tokens · 59130 ms · 2026-05-16T06:19:36.979441+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Sharpen Your Flow: Sharpness-Aware Sampling for Flow Matching

    cs.LG 2026-05 unverdicted novelty 5.0

    SharpEuler estimates a sharpness profile via finite differences on calibration trajectories, smooths it, and applies a quantile transform to generate adaptive timestep grids that improve Euler sampling quality in flow...

Reference graph

Works this paper leans on

110 extracted references · 110 canonical work pages · cited by 1 Pith paper · 11 internal anchors

  1. [1]

    Stochastic Interpolants: A Unifying Framework for Flows and Diffusions

    Michael S Albergo, Nicholas M Boffi, and Eric Vanden-Eijnden. Stochastic interpolants: A unifying framework for flows and diffusions.arXiv preprint arXiv:2303.08797, 2023

  2. [2]

    Learning to sample better.Journal of Statistical Mechanics: Theory and Experiment, 2024(10):104014, 2024

    Michael S Albergo and Eric Vanden-Eijnden. Learning to sample better.Journal of Statistical Mechanics: Theory and Experiment, 2024(10):104014, 2024

  3. [3]

    Chronos-2: From Univariate to Universal Forecasting

    Abdul Fatir Ansari, Oleksandr Shchur, Jaris K¨ uken, Andreas Auer, Boran Han, Pedro Mercado, Syama Sundar Rangapuram, Huibin Shen, Lorenzo Stella, Xiyuan Zhang, et al. Chronos-2: From univariate to universal forecasting.arXiv preprint arXiv:2510.15821, 2025

  4. [4]

    TiRex: Zero-shot forecasting across long and short horizons with en- hanced in-context learning.arXiv preprint arXiv:2505.23719, 2025

    Andreas Auer, Patrick Podest, Daniel Klotz, Sebastian B¨ ock, G¨ unter Klambauer, and Sepp Hochreiter. TiRex: Zero-shot forecasting across long and short horizons with en- hanced in-context learning.arXiv preprint arXiv:2505.23719, 2025

  5. [5]

    doi:10.48550/arXiv.2003.02236

    O. Azencot, N. B. Erichson, V. Lin, and M. W. Mahoney. Forecasting sequential data using Consistent Koopman Autoencoders. Technical Report Preprint: arXiv:2003.02236, 2020

  6. [6]

    Springer Science & Business Media, 2013

    Dominique Bakry, Ivan Gentil, and Michel Ledoux.Analysis and Geometry of Markov Diffusion Operators, volume 348. Springer Science & Business Media, 2013

  7. [7]

    Statistical Models: Theory and Practice, 2006

    Karin Bammann. Statistical Models: Theory and Practice, 2006

  8. [8]

    SDE Matching: Scalable and simulation-free training of latent stochastic differential equations.arXiv preprint arXiv:2502.02472, 2025

    Grigory Bartosh, Dmitry Vetrov, and Christian A Naesseth. SDE Matching: Scalable and simulation-free training of latent stochastic differential equations.arXiv preprint arXiv:2502.02472, 2025

  9. [9]

    J. A. L. Benitez, J. Guo, K. Hegazy, I. Dokmanic, M. W. Mahoney, and M. V. de Hoop. Neural equilibria for long-term prediction of nonlinear conservation laws. Technical Report Preprint: arXiv:2501.06933, 2025

  10. [10]

    PhD thesis, Universit´ e Pierre et Marie Curie-Paris VI, 2012

    Amel Bentata.Markovian Projection of Stochastic Processes. PhD thesis, Universit´ e Pierre et Marie Curie-Paris VI, 2012

  11. [11]

    Error bounds for flow matching methods.arXiv preprint arXiv:2305.16860, 2023

    Joe Benton, George Deligiannidis, and Arnaud Doucet. Error bounds for flow matching methods.arXiv preprint arXiv:2305.16860, 2023

  12. [12]

    Limits of learning dynamical systems.SIAM Review, 67(1):107–137, 2025

    Tyrus Berry and Suddhasattwa Das. Limits of learning dynamical systems.SIAM Review, 67(1):107–137, 2025

  13. [13]

    Nonparametric forecasting of low- dimensional dynamical systems.Physical Review E, 91(3):032915, 2015

    Tyrus Berry, Dimitrios Giannakis, and John Harlim. Nonparametric forecasting of low- dimensional dynamical systems.Physical Review E, 91(3):032915, 2015

  14. [14]

    On the closed- form of flow matching: Generalization does not arise from target stochasticity.arXiv preprint arXiv:2506.03719, 2025

    Quentin Bertrand, Anne Gagneux, Mathurin Massias, and R´ emi Emonet. On the closed- form of flow matching: Generalization does not arise from target stochasticity.arXiv preprint arXiv:2506.03719, 2025

  15. [15]

    Modeling temporal data as continuous functions with stochastic process diffusion, 2023

    Marin Biloˇ s, Kashif Rasul, Anderson Schneider, Yuriy Nevmyvaka, and Stephan G¨ unnemann. Modeling temporal data as continuous functions with stochastic process diffusion, 2023

  16. [16]

    Springer Science & Business Media, 2012

    Denis Bosq.Nonparametric Statistics for Stochastic Processes: Estimation and Prediction, volume 110. Springer Science & Business Media, 2012

  17. [17]

    Mimicking an Itˆ o process by a solution of a stochastic differential equation.The Annals of Applied Probability, 23(4):1584 – 1628, 2013

    Gerard Brunick and Steven Shreve. Mimicking an Itˆ o process by a solution of a stochastic differential equation.The Annals of Applied Probability, 23(4):1584 – 1628, 2013. 13

  18. [18]

    Modern Koopman theory for dynamical systems.arXiv preprint arXiv:2102.12086, 2021

    Steven L Brunton, Marko Budiˇ si´ c, Eurika Kaiser, and J Nathan Kutz. Modern Koopman theory for dynamical systems.arXiv preprint arXiv:2102.12086, 2021

  19. [19]

    Springer Science & Business Media, 2013

    Kung-Sik Chan and Howell Tong.Chaos: A Statistical Perspective. Springer Science & Business Media, 2013

  20. [20]

    Neural ordinary differential equations.Advances in Neural Information Processing Systems, 31, 2018

    Ricky TQ Chen, Yulia Rubanova, Jesse Bettencourt, and David K Duvenaud. Neural ordinary differential equations.Advances in Neural Information Processing Systems, 31, 2018

  21. [21]

    Probabilistic forecasting with stochastic interpolants and f\” ollmer pro- PATH INFERENCE UNDER PARTIAL OBSERVATIONS 23 cesses.arXiv preprint arXiv:2403.13724,

    Yifan Chen, Mark Goldstein, Mengjian Hua, Michael S Albergo, Nicholas M Boffi, and Eric Vanden-Eijnden. Probabilistic forecasting with stochastic interpolants and F¨ ollmer processes.arXiv preprint arXiv:2403.13724, 2024

  22. [22]

    Diffusion maps.Applied and Computational Harmonic Analysis, 21(1):5–30, 2006

    Ronald R Coifman and St´ ephane Lafon. Diffusion maps.Applied and Computational Harmonic Analysis, 21(1):5–30, 2006

  23. [23]

    Long-term forecasting with TiDe: Time-series dense encoder, 2024

    Abhimanyu Das, Weihao Kong, Andrew Leach, Shaan Mathur, Rajat Sen, and Rose Yu. Long-term forecasting with TiDe: Time-series dense encoder, 2024

  24. [24]

    Coarse graining of stochastic differential equations: averaging and projection method.arXiv preprint arXiv:2506.14939, 2025

    Manh Hong Duong, Carsten Hartmann, and Michela Ottobre. Coarse graining of stochastic differential equations: averaging and projection method.arXiv preprint arXiv:2506.14939, 2025

  25. [25]

    Pricing with a smile.Risk, 7(1):18–20, 1994

    Bruno Dupire et al. Pricing with a smile.Risk, 7(1):18–20, 1994

  26. [26]

    Cambridge University Press, 5 edition, 2019

    Rick Durrett.Probability: Theory and Examples. Cambridge University Press, 5 edition, 2019

  27. [27]

    Springer, 2003

    Jianqing Fan and Qiwei Yao.Nonlinear Time Series: Nonparametric and Parametric Methods. Springer, 2003

  28. [28]

    Chaos as an interpretable benchmark for forecasting and data-driven modelling

    William Gilpin. Chaos as an interpretable benchmark for forecasting and data-driven modelling. InThirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2), 2021

  29. [29]

    Chaos as an interpretable benchmark for forecasting and data-driven modelling, 2023

    William Gilpin. Chaos as an interpretable benchmark for forecasting and data-driven modelling, 2023

  30. [30]

    Model scale versus domain knowledge in statistical forecasting of chaotic systems, 2023

    William Gilpin. Model scale versus domain knowledge in statistical forecasting of chaotic systems, 2023

  31. [31]

    Generative learning for nonlinear dynamics.Nature Reviews Physics, 6(3):194–206, 2024

    William Gilpin. Generative learning for nonlinear dynamics.Nature Reviews Physics, 6(3):194–206, 2024

  32. [32]

    Stable generative modelling using Schr¨ odinger bridges.Philosophical Transactions A, 383(2299):20240332, 2025

    Georg A Gottwald, Fengyi Li, Youssef Marzouk, and Sebastian Reich. Stable generative modelling using Schr¨ odinger bridges.Philosophical Transactions A, 383(2299):20240332, 2025

  33. [33]

    Measuring the strangeness of strange attractors

    Peter Grassberger and Itamar Procaccia. Measuring the strangeness of strange attractors. Physica D: Nonlinear Phenomena, 9(1-2):189–208, 1983

  34. [34]

    Large language models are zero-shot time series forecasters.Advances in Neural Information Processing Systems, 36:19622–19635, 2023

    Nate Gruver, Marc Finzi, Shikai Qiu, and Andrew G Wilson. Large language models are zero-shot time series forecasters.Advances in Neural Information Processing Systems, 36:19622–19635, 2023

  35. [35]

    Efficiently Modeling Long Sequences with Structured State Spaces

    Albert Gu, Karan Goel, and Christopher R´ e. Efficiently modeling long sequences with structured state spaces.arXiv preprint arXiv:2111.00396, 2021. 14

  36. [36]

    Mimicking the one-dimensional marginal distributions of processes having an Itˆ o differential.Probability Theory and Related Fields, 71(4):501–516, 1986

    Istv´ an Gy¨ ongy. Mimicking the one-dimensional marginal distributions of processes having an Itˆ o differential.Probability Theory and Related Fields, 71(4):501–516, 1986

  37. [37]

    Out-of-domain generalization in dynamical systems reconstruction, 2024

    Niclas G¨ oring, Florian Hess, Manuel Brenner, Zahra Monfared, and Daniel Durstewitz. Out-of-domain generalization in dynamical systems reconstruction, 2024

  38. [38]

    Kernel density estimation for dynamical systems.Journal of Machine Learning Research, 19(35):1–49, 2018

    Hanyuan Hang, Ingo Steinwart, Yunlong Feng, and Johan AK Suykens. Kernel density estimation for dynamical systems.Journal of Machine Learning Research, 19(35):1–49, 2018

  39. [39]

    Cambridge University Press, 2018

    John Harlim.Data-Driven Computational Methods: Parameter and Operator Estimations. Cambridge University Press, 2018

  40. [40]

    SIAM, 2002

    Philip Hartman.Ordinary Differential Equations. SIAM, 2002

  41. [41]

    Kernel density estimation for time series data

    Andrew Harvey and Vitaliy Oryshchenko. Kernel density estimation for time series data. International Journal of Forecasting, 28(1):3–14, 2012

  42. [42]

    Generalized teacher forcing for learning chaotic dynamics, 2023

    Florian Hess, Zahra Monfared, Manuel Brenner, and Daniel Durstewitz. Generalized teacher forcing for learning chaotic dynamics, 2023

  43. [43]

    Long short-term memory.Neural Computation, 9(8):1735–1780, 1997

    Sepp Hochreiter and J¨ urgen Schmidhuber. Long short-term memory.Neural Computation, 9(8):1735–1780, 1997

  44. [44]

    Neurons with graded response have collective computational proper- ties like those of two-state neurons.Proceedings of the National Academy of Sciences, 81(10):3088–3092, 1984

    John J Hopfield. Neurons with graded response have collective computational proper- ties like those of two-state neurons.Proceedings of the National Academy of Sciences, 81(10):3088–3092, 1984

  45. [45]

    Li, Sheng Wang, Jiheng Zhang, Ziyun Li, and Tianlong Chen

    Yang Hu, Xiao Wang, Zezhen Ding, Lirong Wu, Huatian Zhang, Stan Z. Li, Sheng Wang, Jiheng Zhang, Ziyun Li, and Tianlong Chen. FlowTS: Time series generation via rectified flow, 2025

  46. [46]

    Embedding nonlinear dynamical systems: A guide to Takens’ theorem

    Jeremy P Huke. Embedding nonlinear dynamical systems: A guide to Takens’ theorem. 2006

  47. [47]

    Deterministic and random dynamical systems: theory and numerics

    AR Humphries and AM Stuart. Deterministic and random dynamical systems: theory and numerics. InModern Methods in Scientific Computing and Applications, pages 211–254. Springer, 2002

  48. [48]

    Longitudinal flow matching for trajectory modeling.arXiv preprint arXiv:2510.03569, 2025

    Mohammad Mohaiminul Islam, Thijs P Kuipers, Sharvaree Vadgama, Coen de Vente, Afsana Khan, Clara I S´ anchez, and Erik J Bekkers. Longitudinal flow matching for trajectory modeling.arXiv preprint arXiv:2510.03569, 2025

  49. [49]

    echo state

    Herbert Jaeger. The “echo state” approach to analysing and training recurrent neural networks-with an erratum note.Bonn, Germany: German national research center for information technology gmd technical report, 148(34):13, 2001

  50. [50]

    Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication.Science, 304(5667):78–80, 2004

    Herbert Jaeger and Harald Haas. Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication.Science, 304(5667):78–80, 2004

  51. [51]

    Trajectory generator matching for time series.arXiv preprint arXiv:2505.23215, 2025

    T Jahn, J Chemseddine, P Hagemann, C Wald, and G Steidl. Trajectory generator matching for time series.arXiv preprint arXiv:2505.23215, 2025

  52. [52]

    Time-llm: Time series forecasting by reprogramming large language models,

    Ming Jin, Shiyu Wang, Lintao Ma, Zhixuan Chu, James Y Zhang, Xiaoming Shi, Pin-Yu Chen, Yuxuan Liang, Yuan-Fang Li, Shirui Pan, et al. Time-LLM: Time series forecasting by reprogramming large language models.arXiv preprint arXiv:2310.01728, 2023

  53. [53]

    Diffusion geometry.arXiv preprint arXiv:2405.10858, 2024

    Iolo Jones. Diffusion geometry.arXiv preprint arXiv:2405.10858, 2024

  54. [54]

    On neural differential equations.arXiv preprint arXiv:2202.02435,

    Patrick Kidger. On neural differential equations.arXiv preprint arXiv:2202.02435, 2022. 15

  55. [55]

    The asymmetry of the sAPE measure and other comments on the M3-competition.International Journal of Forecasting, 17(4):570–574, 2001

    Anne B Koehler. The asymmetry of the sAPE measure and other comments on the M3-competition.International Journal of Forecasting, 17(4):570–574, 2001

  56. [56]

    Predict, refine, synthesize: Self-guiding diffusion mod- els for probabilistic time series forecasting.Advances in Neural Information Processing Systems, 36, 2024

    Marcel Kollovieh, Abdul Fatir Ansari, Michael Bohlke-Schneider, Jasper Zschiegner, Hao Wang, and Yuyang Bernie Wang. Predict, refine, synthesize: Self-guiding diffusion mod- els for probabilistic time series forecasting.Advances in Neural Information Processing Systems, 36, 2024

  57. [57]

    Flow matching with Gaussian process priors for probabilistic time series forecasting.arXiv preprint arXiv:2410.03024, 2024

    Marcel Kollovieh, Marten Lienen, David L¨ udke, Leo Schwinn, and Stephan G¨ unnemann. Flow matching with Gaussian process priors for probabilistic time series forecasting.arXiv preprint arXiv:2410.03024, 2024

  58. [58]

    Neural operator: Learning maps between function spaces with applications to PDEs.Journal of Machine Learning Re- search, 24(89):1–97, 2023

    Nikola Kovachki, Zongyi Li, Burigede Liu, Kamyar Azizzadenesheli, Kaushik Bhat- tacharya, Andrew Stuart, and Anima Anandkumar. Neural operator: Learning maps between function spaces with applications to PDEs.Journal of Machine Learning Re- search, 24(89):1–97, 2023

  59. [59]

    Learning continuous models for continuous physics.Communications Physics, 6(1):319, 2023

    Aditi S Krishnapriyan, Alejandro F Queiruga, N Benjamin Erichson, and Michael W Mahoney. Learning continuous models for continuous physics.Communications Physics, 6(1):319, 2023

  60. [60]

    On information and sufficiency.The Annals of Mathematical Statistics, 22(1):79–86, 1951

    Solomon Kullback and Richard A Leibler. On information and sufficiency.The Annals of Mathematical Statistics, 22(1):79–86, 1951

  61. [61]

    On the minimax optimality of flow matching through the connection to kernel density estimation.arXiv preprint arXiv:2504.13336, 2025

    Lea Kunkel and Mathias Trabs. On the minimax optimality of flow matching through the connection to kernel density estimation.arXiv preprint arXiv:2504.13336, 2025

  62. [62]

    Nonparametric estimation of the stationary density and the transition density of a Markov chain.Stochastic Processes and their Applications, 118(2):232–260, 2008

    Claire Lacour. Nonparametric estimation of the stationary density and the transition density of a Markov chain.Stochastic Processes and their Applications, 118(2):232–260, 2008

  63. [63]

    Yale University, 2004

    St´ ephane S Lafon.Diffusion Maps and Geometric Harmonics. Yale University, 2004

  64. [64]

    The principles of diffusion models.arXiv preprint arXiv:2510.21890, 2025

    Chieh-Hsin Lai, Yang Song, Dongjun Kim, Yuki Mitsufuji, and Stefano Ermon. The principles of diffusion models.arXiv preprint arXiv:2510.21890, 2025

  65. [65]

    Multi-marginal stochastic flow matching for high-dimensional snapshot data at irregular time points.arXiv preprint arXiv:2508.04351, 2025

    Justin Lee, Behnaz Moradijamei, and Heman Shakeri. Multi-marginal stochastic flow matching for high-dimensional snapshot data at irregular time points.arXiv preprint arXiv:2508.04351, 2025

  66. [66]

    A survey of the schr \" odinger problem and some of its connections with optimal transport.arXiv preprint arXiv:1308.0215,

    Christian L´ eonard. A survey of the Schr¨ odinger problem and some of its connections with optimal transport.arXiv preprint arXiv:1308.0215, 2013

  67. [67]

    From Fourier to neural ODEs: Flow matching for modeling complex systems.arXiv preprint arXiv:2405.11542, 2024

    Xin Li, Jingdong Zhang, Qunxi Zhu, Chengli Zhao, Xue Zhang, Xiaojun Duan, and Wei Lin. From Fourier to neural ODEs: Flow matching for modeling complex systems.arXiv preprint arXiv:2405.11542, 2024

  68. [68]

    Markov neural operators for learning chaotic systems.arXiv preprint arXiv:2106.06898, pages 2–3, 2021

    Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhat- tacharya, Andrew Stuart, and Anima Anandkumar. Markov neural operators for learning chaotic systems.arXiv preprint arXiv:2106.06898, pages 2–3, 2021

  69. [69]

    On The Hidden Biases of Flow Matching Samplers

    Soon Hoe Lim. On the hidden biases of flow matching samplers.arXiv preprint arXiv:2512.16768, 2025

  70. [70]

    Elucidating the design choice of probability paths in flow matching for forecasting.Transaction on Machine Learning Research, 2025

    Soon Hoe Lim, Yijin Wang, Annan Yu, Emma Hart, Michael W Mahoney, Xiaoye S Li, and N Benjamin Erichson. Elucidating the design choice of probability paths in flow matching for forecasting.Transaction on Machine Learning Research, 2025. 16

  71. [71]

    Flow Matching for Generative Modeling

    Yaron Lipman, Ricky TQ Chen, Heli Ben-Hamu, Maximilian Nickel, and Matt Le. Flow matching for generative modeling.arXiv preprint arXiv:2210.02747, 2022

  72. [72]

    Flow Matching Guide and Code

    Yaron Lipman, Marton Havasi, Peter Holderrieth, Neta Shaul, Matt Le, Brian Karrer, Ricky TQ Chen, David Lopez-Paz, Heli Ben-Hamu, and Itai Gat. Flow matching guide and code.arXiv preprint arXiv:2412.06264, 2024

  73. [73]

    From Navigation to Refinement: Revealing the Two-Stage Nature of Flow-based Diffusion Models through Oracle Velocity

    Haoming Liu, Jinnuo Liu, Yanhao Li, Liuyang Bai, Yunkai Ji, Yuanhe Guo, Shenji Wan, and Hongyi Wen. From navigation to refinement: Revealing the two-stage nature of flow- based diffusion models through oracle velocity.arXiv preprint arXiv:2512.02826, 2025

  74. [74]

    Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

    Xingchao Liu, Chengyue Gong, and Qiang Liu. Flow straight and fast: Learning to generate and transfer data with rectified flow.arXiv preprint arXiv:2209.03003, 2022

  75. [75]

    Sundial: A Family of Highly Capable Time Series Foundation Models

    Yong Liu, Guo Qin, Zhiyuan Shi, Zhi Chen, Caiyin Yang, Xiangdong Huang, Jianmin Wang, and Mingsheng Long. Sundial: A family of highly capable time series foundation models.arXiv preprint arXiv:2502.00816, 2025

  76. [76]

    DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators

    Lu Lu, Pengzhan Jin, and George Em Karniadakis. DeepOnet: Learning nonlinear oper- ators for identifying differential equations based on the universal approximation theorem of operators.arXiv preprint arXiv:1910.03193, 2019

  77. [77]

    Lets forecast: Learning embedology for time series forecasting.arXiv preprint arXiv:2506.06454, 2025

    Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV, Nada Magdi Elkordi, and Yin Li. Lets forecast: Learning embedology for time series forecasting.arXiv preprint arXiv:2506.06454, 2025

  78. [78]

    Distribution learning via neural differential equations: a nonparametric statistical perspective.Journal of Machine Learning Research, 25(232):1–61, 2024

    Youssef Marzouk, Zhi Robert Ren, Sven Wang, and Jakob Zech. Distribution learning via neural differential equations: a nonparametric statistical perspective.Journal of Machine Learning Research, 25(232):1–61, 2024

  79. [79]

    Scoring rules for continuous probability dis- tributions.Management Science, 22(10):1087–1096, 1976

    James E Matheson and Robert L Winkler. Scoring rules for continuous probability dis- tributions.Management Science, 22(10):1087–1096, 1976

  80. [80]

    Statistical inference for dynamical systems: A review

    Kevin McGoff, Sayan Mukherjee, and Natesh Pillai. Statistical inference for dynamical systems: A review. 2015

Showing first 80 references.