Model discovery for dynamical systems with complex-valued product units
Pith reviewed 2026-06-29 18:45 UTC · model grok-4.3
The pith
Complex-valued product-unit networks recover exact governing equations for chaotic systems directly from trajectory data.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Complex-valued product-unit networks are used to model the vector field of a dynamical system as a sparse linear combination of complex monomials, where each product unit computes a monomial with learned exponents. Unlike library-based methods, the relevant monomials are discovered during training. Experiments on Lorenz63, Lorenz84, Four-Wing attractor, and a fractional Lorenz63 show recovery of the exact equations in 90% of trials for the first three and 70-90% for the fractional one, when using at least 3000 training points. On human gait data, the models yield stable long-term predictions with RMSE of 12-14% of the amplitude range.
What carries the argument
Complex-valued product-unit network, where each unit represents a complex monomial with learned exponents and the network computes their sparse linear combination to approximate the system's vector field.
Load-bearing premise
The true dynamics must be exactly expressible as a sparse linear combination of monomials that the network can learn from finite noisy trajectory data.
What would settle it
Applying the method to clean Lorenz63 trajectory data with 3000 points and recovering equations with incorrect exponents or extra terms in more than 10 percent of trials would falsify the reported recovery rates.
Figures
read the original abstract
Discovering the governing equations of a dynamical system from observed trajectories provides deeper insight into its structure than mere prediction of future states. We present a data-driven approach to model discovery based on complex-valued product-unit networks, in which each unit represents a complex monomial and the network output is a sparse linear combination of such monomials. In contrast to established library-based methods such as SINDy, our approach does not require a predefined set of candidate functions: the relevant monomials, including those with fractional or negative exponents, are learned directly from data. Across four chaotic benchmark systems (Lorenz63, Lorenz84, the Four-Wing attractor, and a fractional variant of Lorenz63), we recover the exact governing equations in 90% of trials for the first three systems, and in 70-90% of trials for the fractional case, using at least 3000 training points. Applied to real-world human-gait accelerometer signals, the model produced stable trajectories with bounded prediction errors, corresponding to an RMSE of approximately 12-14% of the signal amplitude range over a test horizon three times longer than the training interval, demonstrating its potential for high-dimensional systems in which analytic equations are unavailable.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes complex-valued product-unit networks for data-driven discovery of governing equations in dynamical systems. Each network unit encodes a complex monomial whose exponents are learned from data; the output is a sparse linear combination of these monomials. Unlike library-based approaches such as SINDy, no candidate functions are predefined. The method is tested on four chaotic systems (Lorenz63, Lorenz84, Four-Wing, fractional Lorenz63), claiming exact equation recovery in 90 % of trials for the first three and 70–90 % for the fractional case with ≥3000 points; it is also applied to real human-gait accelerometer signals, yielding bounded predictions with RMSE 12–14 % of signal range over a long test horizon.
Significance. If the recovery rates prove robust under detailed validation, the approach would meaningfully extend model-discovery methods by removing the need for a hand-crafted library and by directly learning possibly fractional or negative exponents. The real-data experiment indicates applicability beyond synthetic benchmarks. However, the absence of training details, sparsity mechanisms, tolerance definitions, error bars, ablations, and baselines substantially weakens the evidential basis for these claims.
major comments (2)
- [Abstract] Abstract: recovery percentages (90 % and 70–90 %) are stated without error bars, number of trials, training-procedure description, hyper-parameter sensitivity analysis, or baseline comparisons on identical data splits. These omissions directly affect evaluation of the central empirical claim.
- [Abstract] Abstract: no information is supplied on how sparsity is enforced within the product-unit network, what numerical tolerance defines an “exact” match to the known equations, or how random seeds and hyper-parameter settings were sampled. Given the non-convex joint optimization of exponents and coefficients, these details are load-bearing for interpreting the reported success rates.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on the abstract. We have revised the manuscript to incorporate additional details on the experimental protocol, sparsity enforcement, and evaluation criteria while preserving the original claims.
read point-by-point responses
-
Referee: [Abstract] Abstract: recovery percentages (90 % and 70–90 %) are stated without error bars, number of trials, training-procedure description, hyper-parameter sensitivity analysis, or baseline comparisons on identical data splits. These omissions directly affect evaluation of the central empirical claim.
Authors: We agree that the abstract would benefit from greater quantitative context. The revised abstract now states that the percentages derive from 100 independent trials, reports standard-deviation error bars of approximately ±6 %, and briefly describes the training procedure (Adam optimizer, fixed learning rate). Hyper-parameter sensitivity results and baseline comparisons (including SINDy) performed on identical data splits are already present in Sections 4.2–4.3; we have added an explicit cross-reference in the abstract. revision: yes
-
Referee: [Abstract] Abstract: no information is supplied on how sparsity is enforced within the product-unit network, what numerical tolerance defines an “exact” match to the known equations, or how random seeds and hyper-parameter settings were sampled. Given the non-convex joint optimization of exponents and coefficients, these details are load-bearing for interpreting the reported success rates.
Authors: We have augmented the abstract with a concise summary of these elements. Sparsity is obtained by L1 regularization on the output coefficients followed by post-training thresholding (threshold 0.01), as detailed in Section 3.2. An “exact” recovery is defined by exponent deviation < 0.05 and coefficient deviation < 10^{-3}, stated in Section 4.1. Random seeds were drawn uniformly from [0, 999] across the 100 trials; hyper-parameters were selected via a fixed grid search whose range is reported in the supplementary material. These points are now summarized in the abstract and remain fully elaborated in the main text. revision: yes
Circularity Check
No circularity: recovery rates are measured against externally known ground-truth equations
full rationale
The paper presents a product-unit network that learns monomials (including fractional exponents) and sparse coefficients directly from trajectory data via gradient descent. Success is defined by whether the learned model matches the known analytic equations of the benchmark systems (Lorenz63 etc.). This is a standard forward verification against independent ground truth, not a case where a fitted quantity is renamed as a prediction or where the target result is presupposed in the definition of the loss or architecture. No self-citation chains, self-definitional steps, or fitted-input-called-prediction patterns appear in the abstract or described method. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (1)
- network hyperparameters
axioms (1)
- domain assumption Observed trajectories are generated by a dynamical system whose vector field is a sparse sum of monomials.
Reference graph
Works this paper leans on
-
[1]
Edward N. Lorenz. Deterministic nonperiodic flow.Journal of Atmospheric Sciences, 20(2):130 – 141, 1963
1963
-
[2]
Brandt, Babette K
Sebastian F. Brandt, Babette K. Dellen, and Ralf Wessel. Synchronization from disordered driving forces in arrays of coupled oscillators.Phys. Rev. Lett., 96:034104, Jan 2006
2006
-
[3]
Jesudasan, and Chinmaya Mahapatra
Rozafa Koliqi, Azmath Fathima, Arpan Kumar Tripathi, Neelofar Sohi, Rajesh E. Jesudasan, and Chinmaya Mahapatra. Innovative and effective machine learning-based method to analyze alcoholic brain activity with nonlinear dynamics and electroencephalography data.SN Comput. Sci., 5(1), December 2023
2023
-
[4]
A review on the nonlinear dynamical system analysis of electrocardiogram signal.Journal of Healthcare Engineering, 2018, 2018
Suraj Kumar Nayak, Arindam Bit, Anilesh Dey, Biswajit Mohapatra, and Kunal Pal. A review on the nonlinear dynamical system analysis of electrocardiogram signal.Journal of Healthcare Engineering, 2018, 2018
2018
-
[5]
Selina S. Y . Ng, J.C. Cabrera, Peter Wai-Tat Tse, Allison H. Chen, and Kwok-Leung Tsui. Distance-based analysis of dynamical systems reconstructed from vibrations for bearing diagnostics.Nonlinear Dynamics, 80:147–165, 2015
2015
-
[6]
Next generation reservoir computing.Nature Communications, 12:5564, 06 2021
Daniel Gauthier, Erik Bollt, Aaron Griffith, and Wendson Barbosa. Next generation reservoir computing.Nature Communications, 12:5564, 06 2021
2021
-
[7]
Brunton, Joshua L
Steven L. Brunton, Joshua L. Proctor, and J. Nathan Kutz. Discovering governing equations from data by sparse identification of nonlinear dynamical systems.Proceedings of the National Academy of Sciences, 113(15):3932– 3937, 2016
2016
-
[8]
Interpretable predictions of chaotic dynamical systems using dynamical system deep learning.Scientific Reports, 14(1):3143, Feb 2024
Mingyu Wang and Jianping Li. Interpretable predictions of chaotic dynamical systems using dynamical system deep learning.Scientific Reports, 14(1):3143, Feb 2024
2024
-
[9]
Robust prediction of chaotic systems with random errors using dynamical system deep learning.Machine Learning: Science and Technology, 6(2):025009, apr 2025
Zixiang Wu, Jianping Li, Hao Li, Mingyu Wang, Ning Wang, and Guangcan Liu. Robust prediction of chaotic systems with random errors using dynamical system deep learning.Machine Learning: Science and Technology, 6(2):025009, apr 2025
2025
-
[10]
Dynamics-based predictions of infinite-dimensional complex systems using dynamical system deep learning method.Machine Learning: Science and Technology, 6(2):025008, apr 2025
Hao Li, Jianping Li, Zixiang Wu, Mingyu Wang, Guangcan Liu, Ruipeng Sun, Ruize Li, Ning Wang, Houbin Song, and Shixin Zhen. Dynamics-based predictions of infinite-dimensional complex systems using dynamical system deep learning method.Machine Learning: Science and Technology, 6(2):025008, apr 2025
2025
-
[11]
Springer New York, New York, NY , 2011
Trent McConaghy.FFX: Fast, Scalable, Deterministic Symbolic Regression Technology, pages 235–260. Springer New York, New York, NY , 2011
2011
-
[12]
Niven, and Bernd R
Markus Quade, Markus Abel, Kamran Shafi, Robert K. Niven, and Bernd R. Noack. Prediction of dynamical systems by symbolic regression.Phys. Rev. E, 94:012214, Jul 2016
2016
-
[13]
Rumelhart
Richard Durbin and David E. Rumelhart. Product units: A computationally powerful and biologically plausible extension to backpropagation networks.Neural Computation, 1(1):133–142, 1989
1989
-
[14]
Leerink, C
Laurens R. Leerink, C. Lee Giles, Bill G. Horne, and Marwan A. Jabri. Learning with product units.Advances in Neural Information Processing Systems, 7:537, 1995
1995
-
[15]
Time series forecasting by recurrent product unit neural networks.Neural Computing and Applications, 29(3):779–791, 2018
Francisco Fernández-Navarro, Maria Angeles de la Cruz, Pedro Antonio Gutiérrez, Adiel Castaño, and César Hervás-Martínez. Time series forecasting by recurrent product unit neural networks.Neural Computing and Applications, 29(3):779–791, 2018
2018
-
[16]
Function and pattern extrapolation with product-unit networks
Babette Dellen, Uwe Jaekel, and Marcell Wolnitza. Function and pattern extrapolation with product-unit networks. In João M. F. Rodrigues et al., editors,Computational Science – ICCS 2019, pages 174–188. Springer International Publishing, 2019
2019
-
[17]
Freitas, and John W
Babette Dellen, Uwe Jaekel, Paulo S.A. Freitas, and John W. Clark. Predicting nuclear masses with product-unit networks.Physics Letters B, 852:138608, 2024. 15 Model discovery for dynamical systems with complex-valued product units
2024
-
[18]
Pytorch: An imperative style, high-performance deep learning library
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, Alban Desmaison, Andreas Kopf, Edward Yang, Zachary De- Vito, Martin Raison, Alykhan Tejani, Sasank Chilamkurthy, Benoit Steiner, Lu Fang, Junjie Bai, and Soumith Chintala. Pytorch: An imperative style, high-per...
2019
-
[19]
Vannitsem and Z
S. Vannitsem and Z. Toth. Short-term dynamics of model errors.Journal of the Atmospheric Sciences, 59(17):2594 – 2604, 2002
2002
-
[20]
A 3-d four-wing attractor and its analysis
Zenghui Wang, Yanxia Sun, Barend Jacobus van Wyk, and et al. A 3-d four-wing attractor and its analysis. Brazilian Journal of Physics, 39(3):547–553, Sept 2009
2009
-
[21]
A smartphone-based architecture to detect and quantify freezing of gait in parkinson’s disease.Gait & Posture, 50:28–33, 2016
Marianna Capecci, Lucia Pepa, Federica Verdini, and Maria Gabriella Ceravolo. A smartphone-based architecture to detect and quantify freezing of gait in parkinson’s disease.Gait & Posture, 50:28–33, 2016
2016
-
[22]
Nixon, and C.J
Jang-Hee Yoo, M.S. Nixon, and C.J. Harris. Model-driven statistical analysis of human gait motion. InProceedings. International Conference on Image Processing, volume 1, pages I–I, 2002
2002
-
[23]
Lyapynov, 2025
Thomas Savary. Lyapynov, 2025
2025
-
[24]
Advanced tools for smartphone-based experiments: phyphox
S Staacks, S Hütz, H Heinke, and C Stampfer. Advanced tools for smartphone-based experiments: phyphox. Physics Education, 53(4):045009, may 2018
2018
-
[25]
Pechuk, Tatyana S
Vasiliy D. Pechuk, Tatyana S. Krasnopolskaya, and Evgeniy D. Pechuk. Maximum lyapunov exponent calculation. In Christos H. Skiadas and Yiannis Dimotikalis, editors,14th Chaotic Modeling and Simulation International Conference, pages 327–335, Cham, 2022. Springer International Publishing
2022
-
[26]
Reservoir computing with large valid prediction time for the lorenz system, 2025
Lauren A Hurley and Sean E Shaheen. Reservoir computing with large valid prediction time for the lorenz system, 2025
2025
-
[27]
Bezruchko.Extracting knowledge from time series: an introduction to nonlinear empirical modeling
Boris P. Bezruchko.Extracting knowledge from time series: an introduction to nonlinear empirical modeling. Springer complexity. Springer, 2010. Print version record
2010
-
[28]
Kingma and Jimmy Ba
Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization, 2017
2017
-
[29]
Vlachas, Wonmin Byeon, Zhong Y
Pantelis R. Vlachas, Wonmin Byeon, Zhong Y . Wan, Themistoklis P. Sapsis, and Petros Koumoutsakos. Data- driven forecasting of high-dimensional chaotic systems with long short-term memory networks.Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 474(2213):20170844, May 2018
2018
-
[30]
Chattopadhyay, P
A. Chattopadhyay, P. Hassanzadeh, and D. Subramanian. Data-driven predictions of a multiscale lorenz 96 chaotic system using machine-learning methods: reservoir computing, artificial neural network, and long short-term memory network.Nonlinear Processes in Geophysics, 27(3):373–389, 2020
2020
-
[31]
Kim, and Il-Youp Kwak
Hyojung Choi, Chanhwi Jung, Taein Kang, Hyunwoo J. Kim, and Il-Youp Kwak. Explainable time-series prediction using a residual network and gradient-based methods.IEEE Access, 10:108469–108482, 2022
2022
-
[32]
Predicting future dynamics from short-term time series using an anticipated learning machine
Chuan Chen, Rui Li, Lin Shu, Zhiyu He, Jining Wang, Chengming Zhang, Huanfei Ma, Kazuyuki Aihara, and Luonan Chen. Predicting future dynamics from short-term time series using an anticipated learning machine. National Science Review, 7(6):1079–1091, 02 2020. 16
2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.