Pathwise Learning of Stochastic Dynamical Systems with Partial Observations
Pith reviewed 2026-05-16 09:36 UTC · model grok-4.3
The pith
A neural variational control method learns an SDE that induces the filtering path measure from noisy partial observations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We first derive a stochastic control problem that solves the filtering posterior path measure corresponding to a pathwise Zakai equation. We then construct a generative model that maps the prior path measure to the posterior measure through the controlled diffusion and the associated Radon-Nikodym derivative. Through an amortization of sample paths of the observation process, the control is learned through the noisy observation paths and we learn an associated SDE which induces the filtering path measure. The approach is demonstrated on various nonlinear stochastic systems, showcasing its ability to handle multimodal data distributions, chaotic dynamics, and sparse observation data.
What carries the argument
The generative model that uses a controlled diffusion and its Radon-Nikodym derivative to transform the prior path measure into the posterior filtering measure, with the control learned by amortization over observation paths.
If this is right
- The learned SDE generates trajectories that match the filtering posterior for new observation paths.
- The method simultaneously approximates SDE coefficients and infers posterior updates from indirect data.
- It accommodates multimodal distributions and chaotic dynamics without requiring full-state training data.
- Performance holds for sparse and nonlinear observation schedules.
Where Pith is reading between the lines
- The amortization step could support efficient reuse across large batches of observation sequences.
- The same control construction might be adapted to learn unknown parameters in the observation model jointly with the dynamics.
- Pathwise learning may preserve temporal structure better than snapshot-based filtering in long trajectories.
Load-bearing premise
The variational approximation via the controlled diffusion and Radon-Nikodym derivative recovers the true posterior path measure accurately even for multimodal or chaotic systems, with negligible error from the neural parameterization.
What would settle it
Simulate data from a known chaotic system such as the stochastic Lorenz equations under partial noisy observations, run the learned SDE, and check whether its generated path statistics match those produced by an exact particle filter within sampling error.
Figures
read the original abstract
The reconstruction and inference of stochastic dynamical systems from data is a fundamental task in inverse problems and statistical learning. While surrogate modeling advances computational methods to approximate these dynamics, standard approaches typically require high-fidelity training data. In many practical settings, the data are indirectly observed through noisy and nonlinear measurement. The challenge lies not only in approximating the coefficients of the SDEs, but in simultaneously inferring the posterior updates given the observations. In this work, we present a neural path estimation approach to solve stochastic dynamical systems based on variational inference. We first derive a stochastic control problem that solve filtering posterior path measure corresponding to a pathwise Zakai equation. We then construct a generative model that maps the prior path measure to posterior measure through the controlled diffusion and the associated Randon-Nykodym derivative. Through an amortization of sample paths of the observation process, the control is learned through the noisy observation paths and we learn an associated SDE which induces the filtering path measure. In the end, we demonstrate the model's performance on various nonlinear stochastic systems, showcasing its ability to handle multimodal data distributions, chaotic dynamics, and sparse observation data.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to introduce a neural path estimation method for learning stochastic dynamical systems from partial noisy observations. It derives a stochastic control problem from the pathwise Zakai equation whose solution yields the filtering posterior path measure, constructs a generative model via controlled diffusion and Radon-Nikodym derivative to map prior to posterior, and amortizes the control over sample observation paths to learn an SDE inducing the filtering measure. The approach is demonstrated on nonlinear systems handling multimodal distributions, chaotic dynamics, and sparse data.
Significance. If the variational scheme is shown to recover accurate posteriors with quantitative validation, the work could provide a useful amortized variational inference tool for inverse problems involving SDEs under partial observations, extending pathwise filtering ideas to neural control settings.
major comments (3)
- [Abstract and numerical experiments] Abstract and numerical experiments section: The manuscript asserts performance on multimodal, chaotic, and sparsely observed nonlinear systems, yet supplies no quantitative error metrics, convergence rates, baseline comparisons, or validation details. This absence is load-bearing for the central claim that the amortized control recovers the filtering path measure effectively.
- [Derivation] Derivation section (pathwise Zakai to control problem): The transition from the pathwise Zakai equation to the stochastic control formulation is described at a high level without explicit equations showing how the control process is chosen so that the induced measure matches the posterior; the Radon-Nikodym term and its neural parameterization require a concrete statement of the objective functional.
- [Amortization step] Amortization and learning step: The claim that amortization over observation paths yields a control whose induced SDE approximates the true posterior relies on the unverified assumption that the neural parameterization incurs negligible error for multimodal or chaotic dynamics; no error bounds or diagnostic checks are provided to support this.
minor comments (3)
- [Abstract] Abstract: 'Randon-Nykodym' is misspelled and should read 'Radon-Nikodym'.
- [Abstract] Abstract: The phrase 'a stochastic control problem that solve filtering posterior path measure' contains a grammatical error and should be rephrased for clarity.
- [Generative model] Notation: The distinction between the controlled diffusion process and the induced path measure should be stated more explicitly to avoid ambiguity in the generative model description.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and have revised the manuscript to incorporate additional quantitative validation, explicit derivations, and empirical diagnostics where appropriate.
read point-by-point responses
-
Referee: [Abstract and numerical experiments] Abstract and numerical experiments section: The manuscript asserts performance on multimodal, chaotic, and sparsely observed nonlinear systems, yet supplies no quantitative error metrics, convergence rates, baseline comparisons, or validation details. This absence is load-bearing for the central claim that the amortized control recovers the filtering path measure effectively.
Authors: We agree that quantitative metrics are necessary to support the central claims. In the revised manuscript we have added L2 pathwise estimation errors against ground-truth trajectories, KL divergence estimates between the learned and reference posterior measures (computed via long-run Monte Carlo), and direct comparisons against particle-filter baselines on the same multimodal and chaotic examples. Convergence behavior of the variational objective with respect to network width and training iterations is now shown in the supplementary material. revision: yes
-
Referee: [Derivation] Derivation section (pathwise Zakai to control problem): The transition from the pathwise Zakai equation to the stochastic control formulation is described at a high level without explicit equations showing how the control process is chosen so that the induced measure matches the posterior; the Radon-Nikodym term and its neural parameterization require a concrete statement of the objective functional.
Authors: We have expanded the derivation section with the missing explicit steps. Starting from the pathwise Zakai equation, we apply Girsanov’s theorem to obtain the controlled diffusion whose law is absolutely continuous with respect to the prior; the Radon–Nikodym derivative is written explicitly as exp(∫ u·dW − ½∫|u|² dt). The resulting objective functional is the expectation of ½∫|u_t|² dt minus the integrated observation likelihood term, which is minimized by the neural parameterization of u. The revised text now states this functional and the optimality condition that equates the controlled measure to the filtering posterior. revision: yes
-
Referee: [Amortization step] Amortization and learning step: The claim that amortization over observation paths yields a control whose induced SDE approximates the true posterior relies on the unverified assumption that the neural parameterization incurs negligible error for multimodal or chaotic dynamics; no error bounds or diagnostic checks are provided to support this.
Authors: We acknowledge that rigorous a-priori error bounds for the neural control in multimodal or chaotic regimes are not currently available. In the revision we have added empirical diagnostics: (i) variance across ten independent training runs with different seeds, (ii) side-by-side posterior sample histograms against reference particle-filter solutions, and (iii) a brief discussion invoking the universal approximation property of the chosen network class. These checks indicate that the amortization error remains small on the tested systems; a full theoretical analysis is noted as future work. revision: partial
Circularity Check
Derivation chain is self-contained with no circular reductions
full rationale
The paper first derives a stochastic control problem from the pathwise Zakai equation to recover the filtering posterior path measure, then constructs a generative model using controlled diffusion and the Radon-Nikodym derivative, and finally amortizes over observation paths to learn the control. This sequence follows standard variational inference and stochastic filtering techniques without reducing any claimed prediction or result to its own fitted inputs by construction. No self-citations are invoked as load-bearing uniqueness theorems, no ansatzes are smuggled via prior work, and no known empirical patterns are merely renamed. The central claims remain independent of the target outputs.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We first derive a stochastic control problem that solve filtering posterior path measure corresponding to a pathwise Zakai equation... controlled diffusion and the associated Radon-Nikodym derivative.
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The optimal control is achieved at u⋆(t,x)=gy(t,x)−a(t,x)∇xVt
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Stochastic Interpolants: A Unifying Framework for Flows and Diffusions
[ABVE23] Michael S Albergo, Nicholas M Boffi, and Eric Vanden-Eijnden. Stochastic interpolants: A unifying framework for flows and diffusions.arXiv preprint arXiv:2303.08797,
work page internal anchor Pith review Pith/arXiv arXiv
-
[2]
[BBCB20] Marc Bocquet, Julien Brajard, Alberto Carrassi, and Laurent Bertino. Bayesian inference of chaotic dynamics by merging data assimilation, machine learning and expectation- maximization.arXiv preprint arXiv:2001.06270,
-
[3]
The Ensemble Schr{\"o}dinger Bridge filter for Nonlinear Data Assimilation
[BS25] Feng Bao and Hui Sun. The ensemble schr{\” o}dinger bridge filter for nonlinear data assimilation.arXiv preprint arXiv:2512.18928,
work page internal anchor Pith review Pith/arXiv arXiv
-
[4]
[BZZ23a] Feng Bao, Guannan Zhang, and Zezhong Zhang. A unified filter method for jointly estimating state and parameters of stochastic dynamical systems via the ensemble score filter.arXiv preprint arXiv:2312.10503,
-
[5]
A score-based non- linear filter for data assimilation
[BZZ23b] Feng Bao, Zezhong Zhang, and Guannan Zhang. A score-based nonlinear filter for data assim- ilation.arXiv preprint arXiv:2306.09282,
-
[6]
Probabilistic forecasting with stochastic interpolants and Föllmer processes
[CGH+24] Yifan Chen, Mark Goldstein, Mengjian Hua, Michael S Albergo, Nicholas M Boffi, and Eric Vanden-Eijnden. Probabilistic forecasting with stochastic interpolants and f\” ollmer pro- PATH INFERENCE UNDER PARTIAL OBSERVATIONS 23 cesses.arXiv preprint arXiv:2403.13724,
-
[7]
Diffusion Posterior Sampling for General Noisy Inverse Problems
[CKM+22] Hyungjin Chung, Jeongsol Kim, Michael T Mccann, Marc L Klasky, and Jong Chul Ye. Diffu- sion posterior sampling for general noisy inverse problems.arXiv preprint arXiv:2209.14687,
work page internal anchor Pith review Pith/arXiv arXiv
-
[8]
[CM25] Heng-Sheng Chang and Prashant G Mehta. Dual filter: A mathematical framework for inference using transformer-like architectures.arXiv preprint arXiv:2505.00818,
-
[9]
[CSC+22] Chen-Hao Chao, Wei-Fang Sun, Bo-Wun Cheng, Yi-Chen Lo, Chia-Che Chang, Yu-Lun Liu, Yu-Lin Chang, Chia-Ping Chen, and Chun-Yi Lee. Denoising likelihood score matching for conditional score-based data generation.arXiv preprint arXiv:2203.14206,
-
[10]
Reflected schr\” odinger bridge for constrained generative modeling.arXiv preprint arXiv:2401.03228,
[DCY+24] Wei Deng, Yu Chen, Nicole Tianjiao Yang, Hengrong Du, Qi Feng, and Ricky TQ Chen. Reflected schr\” odinger bridge for constrained generative modeling.arXiv preprint arXiv:2401.03228,
-
[11]
[DGMO09] Randal Douc, Aur´ elien Garivier, Eric Moulines, and Jimmy Olsson. On the forward filtering backward smoothing particle approximations of the smoothing distribution in general state spaces models.arXiv preprint arXiv:0904.0316,
work page internal anchor Pith review Pith/arXiv arXiv
-
[12]
On neural differential equations
24 [Kid22] Patrick Kidger. On neural differential equations.arXiv preprint arXiv:2202.02435,
-
[13]
A variational approach to sampling in diffusion processes.arXiv preprint arXiv:2405.00126,
[Rag24] Maxim Raginsky. A variational approach to sampling in diffusion processes.arXiv preprint arXiv:2405.00126,
-
[14]
Score-based data assimilation, 2023
[RL23] Fran¸ cois Rozet and Gilles Louppe. Score-based data assimilation.arXiv preprint arXiv:2306.10574,
-
[15]
[TR19] Belinda Tzen and Maxim Raginsky. Neural stochastic differential equations: Deep latent gauss- ian models in the diffusion limit.arXiv preprint arXiv:1905.09883,
-
[16]
Relative arbitrage opportunities in an extended mean field system.arXiv preprint arXiv:2311.02690,
[YI25b] Nicole Tianjiao Yang and Tomoyuki Ichiba. Relative arbitrage opportunities in an extended mean field system.arXiv preprint arXiv:2311.02690,
-
[17]
An optimal control approach to particle filtering.Automatica, 151:110894, 2023
[ZTC23] Qinsheng Zhang, Amirhossein Taghvaei, and Yongxin Chen. An optimal control approach to particle filtering.Automatica, 151:110894, 2023
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.