Physics-Guided Recurrent State-Space Neural Networks for Multi-Step Prediction

Ajay Seth; Manon Kok; Ruiyuan Li

arxiv: 2606.02278 · v1 · pith:YEF47RZ3new · submitted 2026-06-01 · 📡 eess.SY · cs.LG· cs.SY

Physics-Guided Recurrent State-Space Neural Networks for Multi-Step Prediction

Ruiyuan Li , Ajay Seth , Manon Kok This is my paper

Pith reviewed 2026-06-28 13:30 UTC · model grok-4.3

classification 📡 eess.SY cs.LGcs.SY

keywords physics-guided neural networksrecurrent state-space modelsmulti-step predictionhybrid modelingstable traininglimited datapartial physical knowledgesystem identification

0 comments

The pith

A physics-guided recurrent state-space neural network achieves stable multi-step predictions by combining partial physical models with neural feedback structures.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces the PG-RSSNN to address poor long-term forecasts from inaccurate physical state-space models and the data hunger of black-box neural networks. It embeds physics guidance inside a recurrent neural structure that feeds back estimated states, allowing non-saturating activations while avoiding vanishing gradients and training divergence. Experiments on linear systems with noise, a robotic arm, and a cascaded water tank demonstrate better multi-step accuracy than either pure physics or pure neural baselines, even when training data is limited and the embedded physical model is only approximate. A reader would care because the approach shows how incomplete domain knowledge can be turned into reliable forecasts without requiring either perfect equations or massive datasets.

Core claim

The PG-RSSNN maintains stable training behavior and improves multi-step predictions by incorporating recurrent structures to enable the use of non-saturating activation functions in multi-step prediction. It mitigates the vanishing gradients and eliminates the risk of numerical divergence in training seen in existing structures that feed back state estimates. Results across multiple systems with various physical model imperfections show that the proposed PG-RSSNN outperforms black-box neural networks and physics-only models, even with limited training data and when physical models are only partially known.

What carries the argument

The recurrent feedback structure inside the physics-guided state-space neural network that passes state estimates forward while injecting partial physical dynamics at each step.

If this is right

Multi-step predictions improve over both black-box neural networks and physics-only models across tested systems.
Training remains stable when non-saturating activations are used in the recurrent loop.
Performance holds with limited training data.
The method tolerates partially known physical models containing imperfections such as Gaussian noise or unmodeled dynamics.
The approach applies to both linear and nonlinear systems including robotic arms and fluid tanks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same recurrent guidance pattern could be tested on other prediction tasks such as time-series forecasting in energy systems where partial physics is available.
If the physical model mismatch grows beyond the tested levels, additional regularization on the physics injection term might be needed to preserve stability.
The structure suggests a route to embed other forms of domain knowledge, such as conservation laws, directly into the state transition without full model fidelity.

Load-bearing premise

That feeding back state estimates through the recurrent structure will reliably allow non-saturating activations without introducing new instabilities or requiring the physical model to be fully accurate.

What would settle it

If the PG-RSSNN exhibits training divergence or lower multi-step prediction accuracy than the black-box neural network baseline on the cascaded water tank system, the performance claims would not hold.

Figures

Figures reproduced from arXiv: 2606.02278 by Ajay Seth, Manon Kok, Ruiyuan Li.

**Figure 2.** Figure 2: Cascaded tank dataset results for different training [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

read the original abstract

State-space models are traditionally based on physical knowledge, but multi-step predictions from these physical models can be poor due to model inaccuracy. Black-box deep learning has shown promise as an alternative. However, these methods rely on the availability of large datasets and potentially available physical knowledge is neglected. We propose the PG-RSSNN, a physics-guided recurrent state-space neural network that incorporates recurrent structures to enable the use of non-saturating activation functions in multi-step prediction. It mitigates the vanishing gradients and eliminates the risk of numerical divergence in training seen in existing structures that feed back state estimates. Results across multiple systems with various physical model imperfections, from linear state-space models with Gaussian noise to a robotic arm and a cascaded water tank system, show that the proposed PG-RSSNN maintains stable training behavior, and improves multi-step predictions, as compared with black-box neural networks and physics-only models, even with limited training data and when physical models are only partially known.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PG-RSSNN gives a targeted recurrent fix for stable multi-step prediction in hybrid physics-ML state-space models on control tasks with scarce data.

read the letter

The paper's core contribution is a recurrent feedback structure inside a physics-guided state-space neural net that lets non-saturating activations be used for multi-step rollouts without the divergence or vanishing-gradient problems that hit earlier hybrid designs.

It tests this on linear systems with noise, a robotic arm, and a cascaded water-tank setup. The results indicate better long-term predictions than either pure physics models or black-box networks when training data is limited and the physical model is only partially accurate. The experiments deliberately vary the degree of model mismatch, which matches the practical setting the method targets.

The architecture change is presented as new relative to the cited prior work on physics-informed networks, and the empirical pattern is consistent with the claimed mechanism. Credit is due for focusing on a narrow but recurring pain point in robotics and process control rather than claiming broad ML advances.

The main limitation is that the abstract gives no quantitative deltas or ablation tables, so the size of the improvement and its sensitivity to hyper-parameters remain unclear until the full numbers are checked. No load-bearing circularity or internal contradiction appears in the description.

This is aimed at control engineers and hybrid-modeling practitioners who already have approximate physics and need better forecasts from small datasets. It is not foundational for general machine learning. The work is coherent on its own terms and the experiments address the stated goal, so it should go to peer review rather than desk rejection.

Referee Report

1 major / 0 minor

Summary. The manuscript proposes the PG-RSSNN, a physics-guided recurrent state-space neural network that incorporates recurrent structures to enable non-saturating activation functions for multi-step prediction. This is claimed to mitigate vanishing gradients and eliminate numerical divergence during training. Empirical results across linear state-space models with Gaussian noise, a robotic arm, and a cascaded water tank system are said to show stable training and improved multi-step predictions relative to black-box neural networks and physics-only models, even under limited training data and partially known physical models.

Significance. If the empirical claims hold with the promised quantitative support, the work could offer a practical route to hybrid physics-data modeling that stabilizes training while exploiting partial domain knowledge, which is valuable for control and system identification tasks where data are scarce. The recurrent feedback mechanism is positioned as directly solving a known instability in existing hybrid architectures.

major comments (1)

Abstract: the central claim of improved multi-step predictions and stable training is stated without any equations, training details, quantitative metrics, ablation studies, or specific performance numbers; the claim cannot be evaluated from the provided text.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their feedback. We address the single major comment below.

read point-by-point responses

Referee: [—] Abstract: the central claim of improved multi-step predictions and stable training is stated without any equations, training details, quantitative metrics, ablation studies, or specific performance numbers; the claim cannot be evaluated from the provided text.

Authors: We agree that the abstract is written at a high level and does not contain quantitative metrics, equations, or training details. The manuscript body provides the PG-RSSNN equations (Section 3), training procedure (Section 4), specific metrics (e.g., multi-step RMSE on the robotic arm and cascaded tank), ablation results, and comparisons to black-box and physics-only baselines (Section 5). To address the concern, we will revise the abstract to include key quantitative results from the experiments. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper proposes the PG-RSSNN architecture and reports empirical results on multiple physical systems (linear SSMs with noise, robotic arm, cascaded tanks) under partial model knowledge and limited data. No derivation chain, fitted parameters renamed as predictions, self-citation load-bearing steps, or ansatz smuggling appear in the abstract or description. The central claims rest on experimental comparisons to black-box NNs and physics-only models rather than any reduction by construction to inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5703 in / 1057 out tokens · 28059 ms · 2026-06-28T13:30:07.102078+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

17 extracted references · 3 canonical work pages · 2 internal anchors

[1]

and Murray, R

˚Astr¨ om, K.J. and Murray, R. (2021). Feedback systems: an introduction for scientists and engineers . Princeton University Press

2021
[2]

Champneys, M., Beintema, G.I., T´ oth, R., Schoukens, M., and Rogers, T.J. (2024). Baseline results for selected nonlinear system identification benchmarks. IFAC- PapersOnLine, 58(15), 474–479

2024
[3]

Chung, J., Gulcehre, C., Cho, K., and Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555

work page internal anchor Pith review Pith/arXiv arXiv 2014
[4]

and Haviland, J

Corke, P. and Haviland, J. (2021). Not your grand- mother’s toolbox–the robotics toolbox reinvented for python. In Proceedings of IEEE International Conference on Robotics and Automation , 11357–11363

2021
[5]

Gedon, D., Wahlstr¨ om, N., Sch¨ on, T.B., and Ljung, L. (2021). Deep state space models for nonlinear system identification. IFAC-PapersOnLine, 54(7), 481–486

2021
[6]

Glorot, X., Bordes, A., and Bengio, Y. (2011). Deep sparse rectifier neural networks. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics, 315–323

2011
[7]

Gustafsson, F. (2010). Statistical sensor fusion. Studentlit- teratur

2010
[8]

Karpatne, A., Watkins, W., Read, J., and Kumar, V. (2017). Physics-guided neural networks (PGNN): An application in lake temperature modeling. arXiv preprint arXiv:1710.11431

work page arXiv 2017
[9]

Kingma, D.P. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980

work page internal anchor Pith review Pith/arXiv arXiv 2014
[10]

Liu, Y., T´ oth, R., and Schoukens, M. (2024). Physics- guided state-space model augmentation using weighted regularized neural networks. IFAC-PapersOnLine, 58(15), 295–300

2024
[11]

Relan, R., Tiels, K., Marconato, A., and Schoukens, J. (2017). An unstructured flexible nonlinear model for the cascaded water-tanks benchmark. IFAC-PapersOnLine, 50(1), 452–457. Sch¨ on, O., G¨ otte, R.S., and Timmermann, J. (2022). Multi- objective physics-guided recurrent neural networks for identifying non-autonomous dynamical systems. IFAC- PapersOnL...

2017
[12]

and No¨ el, J.P

Schoukens, M. and No¨ el, J.P. (2017). Three benchmarks addressing open challenges in nonlinear system identifi- cation. IFAC-PapersOnLine, 50(1), 446–451

2017
[13]

Suykens, J.A., De Moor, B.L., and Vandewalle, J. (1995). Nonlinear system identification using neural state space models, applicable to robust control design.International Journal of Control , 62(1), 129–152

1995
[14]

and Verdult, V

Verhaegen, M. and Verdult, V. (2007).Filtering and System Identification: A Least Squares Approach . Cambridge University Press

2007
[15]

and Orin, D.E

Walker, M.W. and Orin, D.E. (1982). Efficient dynamic computer simulation of robotic mechanisms. Journal of Dynamic Systems, Measurement, and Control , 104(3), 205–211

1982
[16]

Weigand, J., G¨ otz, J., Ulmen, J., and Ruskowski, M. (2022). Dataset and baseline for an industrial robot identification benchmark. In Proceedings of the 6th Workshop on Nonlinear System Identification Benchmarks

2022
[17]

Yu, C., Chen, J., Li, S., and Verhaegen, M. (2020). Identifi- cation of affinely parameterized state–space models with unknown inputs. Automatica, 122, 109271

2020

[1] [1]

and Murray, R

˚Astr¨ om, K.J. and Murray, R. (2021). Feedback systems: an introduction for scientists and engineers . Princeton University Press

2021

[2] [2]

Champneys, M., Beintema, G.I., T´ oth, R., Schoukens, M., and Rogers, T.J. (2024). Baseline results for selected nonlinear system identification benchmarks. IFAC- PapersOnLine, 58(15), 474–479

2024

[3] [3]

Chung, J., Gulcehre, C., Cho, K., and Bengio, Y. (2014). Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555

work page internal anchor Pith review Pith/arXiv arXiv 2014

[4] [4]

and Haviland, J

Corke, P. and Haviland, J. (2021). Not your grand- mother’s toolbox–the robotics toolbox reinvented for python. In Proceedings of IEEE International Conference on Robotics and Automation , 11357–11363

2021

[5] [5]

Gedon, D., Wahlstr¨ om, N., Sch¨ on, T.B., and Ljung, L. (2021). Deep state space models for nonlinear system identification. IFAC-PapersOnLine, 54(7), 481–486

2021

[6] [6]

Glorot, X., Bordes, A., and Bengio, Y. (2011). Deep sparse rectifier neural networks. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics, 315–323

2011

[7] [7]

Gustafsson, F. (2010). Statistical sensor fusion. Studentlit- teratur

2010

[8] [8]

Karpatne, A., Watkins, W., Read, J., and Kumar, V. (2017). Physics-guided neural networks (PGNN): An application in lake temperature modeling. arXiv preprint arXiv:1710.11431

work page arXiv 2017

[9] [9]

Kingma, D.P. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980

work page internal anchor Pith review Pith/arXiv arXiv 2014

[10] [10]

Liu, Y., T´ oth, R., and Schoukens, M. (2024). Physics- guided state-space model augmentation using weighted regularized neural networks. IFAC-PapersOnLine, 58(15), 295–300

2024

[11] [11]

Relan, R., Tiels, K., Marconato, A., and Schoukens, J. (2017). An unstructured flexible nonlinear model for the cascaded water-tanks benchmark. IFAC-PapersOnLine, 50(1), 452–457. Sch¨ on, O., G¨ otte, R.S., and Timmermann, J. (2022). Multi- objective physics-guided recurrent neural networks for identifying non-autonomous dynamical systems. IFAC- PapersOnL...

2017

[12] [12]

and No¨ el, J.P

Schoukens, M. and No¨ el, J.P. (2017). Three benchmarks addressing open challenges in nonlinear system identifi- cation. IFAC-PapersOnLine, 50(1), 446–451

2017

[13] [13]

Suykens, J.A., De Moor, B.L., and Vandewalle, J. (1995). Nonlinear system identification using neural state space models, applicable to robust control design.International Journal of Control , 62(1), 129–152

1995

[14] [14]

and Verdult, V

Verhaegen, M. and Verdult, V. (2007).Filtering and System Identification: A Least Squares Approach . Cambridge University Press

2007

[15] [15]

and Orin, D.E

Walker, M.W. and Orin, D.E. (1982). Efficient dynamic computer simulation of robotic mechanisms. Journal of Dynamic Systems, Measurement, and Control , 104(3), 205–211

1982

[16] [16]

Weigand, J., G¨ otz, J., Ulmen, J., and Ruskowski, M. (2022). Dataset and baseline for an industrial robot identification benchmark. In Proceedings of the 6th Workshop on Nonlinear System Identification Benchmarks

2022

[17] [17]

Yu, C., Chen, J., Li, S., and Verhaegen, M. (2020). Identifi- cation of affinely parameterized state–space models with unknown inputs. Automatica, 122, 109271

2020