pith. machine review for the scientific record. sign in

arxiv: 2605.12248 · v1 · submitted 2026-05-12 · 📊 stat.ME · stat.AP· stat.CO

Recognition: no theorem link

Time-variant reliability using time-dependent surrogate models

Bruno Sudret, Stefano Marelli, Styfen Sch\"ar

Pith reviewed 2026-05-13 03:20 UTC · model grok-4.3

classification 📊 stat.ME stat.APstat.CO
keywords time-variant reliabilitysurrogate modelsNARXfirst-passage probabilitydynamical systemsreduced-order modelingbiased samplingstochastic excitations
0
0 comments X

The pith

Manifold-NARX and functional NARX surrogates, with biased sampling, enable accurate first-passage probability estimates for time-dependent dynamical systems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops two surrogate modeling approaches for assessing the reliability of dynamical systems under random loads over time. Traditional Monte Carlo methods are too slow for complex models, so the authors create mNARX, which uses a reduced manifold of state variables, and F-NARX, which uses principal components from time windows. Tested on a quarter-car model and a Bouc-Wen oscillator, these methods capture the rare events leading to failure when using biased sampling. This matters because it makes safety analysis feasible for realistic engineering systems where direct simulation is impractical.

Core claim

The mNARX and F-NARX frameworks construct surrogate models for time-variant reliability problems by embedding physical insight into regression formulations on reduced manifolds or principal component features, and when used with biased experimental designs, they accurately capture tail behavior for precise first-passage probability estimates.

What carries the argument

mNARX constructs the surrogate on a reduced-order manifold of auxiliary state variables, while F-NARX extracts principal component features from continuous time windows to handle trajectories.

If this is right

  • First-passage probabilities can be estimated with far fewer model evaluations than standard Monte Carlo for high-dimensional stochastic excitations.
  • The surrogates handle long-memory effects and avoid arbitrary lag selection in discrete time series.
  • Tail behavior of system responses becomes accessible without full exploration of the input space.
  • Both frameworks extend established surrogate techniques like polynomial chaos and Kriging to time-dependent reliability settings.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The methods may generalize to other time-evolving quantities such as cumulative damage or fatigue life under stochastic loading.
  • Automated selection of the manifold dimension or number of principal components could further reduce reliance on expert physical insight.
  • Similar reduced-order trajectory modeling might improve efficiency in related areas like stochastic control or real-time monitoring of dynamical systems.

Load-bearing premise

The chosen reduced manifold or principal components, together with biased sampling, must be sufficient to represent the extreme tail behavior of the true dynamical system without introducing bias in the failure probability estimate.

What would settle it

Direct Monte Carlo simulation on the benchmark systems using the same biased experimental designs would produce failure probability estimates that differ substantially from those given by the mNARX or F-NARX surrogates.

Figures

Figures reproduced from arXiv: 2605.12248 by Bruno Sudret, Stefano Marelli, Styfen Sch\"ar.

Figure 1
Figure 1. Figure 1: Graphical illustration of the input preprocessing step in mNARX, where a high [PITH_FULL_IMAGE:figures/full_fig_p009_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Graphical illustration of the manifold construction in mNARX, where auxiliary [PITH_FULL_IMAGE:figures/full_fig_p010_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Schematic of the Quarter-Car model subjected to stochastic road excitation [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Graphical depiction of the biased sampling strategy in Section 4.3. In both panels, [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Selection of out-of-sample response traces for an mNARX model and a classical NARX [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Empirical PDFs of the QoI y2(t) for different experimental design sizes NED, comparing unbiased and biased sampling strategies. 15 [PITH_FULL_IMAGE:figures/full_fig_p015_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: First-passage probability curves as a function of the threshold level for the quarter car [PITH_FULL_IMAGE:figures/full_fig_p016_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Graphical representation of the feature extraction process at the core of the [PITH_FULL_IMAGE:figures/full_fig_p018_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Left: graphical illustration of the hysteretic behavior of the Bouc-Wen oscillator [PITH_FULL_IMAGE:figures/full_fig_p020_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Graphical depiction of the biased sampling strategy in Section 4.3. In both panels, the [PITH_FULL_IMAGE:figures/full_fig_p021_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Selected out-of-sample traces comparing the ground-truth Bouc-Wen response (solid [PITH_FULL_IMAGE:figures/full_fig_p022_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Response histograms and first-passage probability estimates for the Bouc-Wen [PITH_FULL_IMAGE:figures/full_fig_p023_12.png] view at source ↗
read the original abstract

Time-variant reliability analysis is a critical task for ensuring the safety of engineering dynamical systems subjected to stochastic excitations. However, assessing failure probability for realistic systems with Monte-Carlo simulation-based methods is often computationally intractable due to the high cost of the underlying models and the large number of simulations required. While surrogate models such as polynomial chaos expansions or Kriging are well-established for time-invariant reliability problems, their direct application to time-dependent systems remains challenging. This chapter introduces two advanced surrogate modeling frameworks designed specifically for dynamical systems: manifold-NARX (mNARX) and functional NARX (F-NARX). The mNARX approach constructs the surrogate on a reduced-order manifold of auxiliary state variables, enabling the efficient handling of high-dimensional inputs by embedding physical insight into a regression formulation. Conversely, the F-NARX framework exploits the functional nature of system trajectories, extracting principal component features from continuous time windows to mitigate issues associated with discrete lag selection and long-memory effects. We demonstrate the efficacy of these methods on two benchmark reliability problems: a stochastic quarter-car model and a hysteretic Bouc-Wen oscillator. The results highlight that, when combined with suitably biased experimental designs, both frameworks accurately capture the tail behavior of the system response, enabling precise and efficient estimation of first-passage probabilities.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript introduces two surrogate modeling frameworks—manifold-NARX (mNARX) and functional NARX (F-NARX)—for time-variant reliability analysis of dynamical systems under stochastic excitation. mNARX embeds physical insight via a reduced-order manifold of auxiliary states, while F-NARX extracts principal-component features from time windows to handle long-memory effects. Both are combined with biased experimental designs and demonstrated on a stochastic quarter-car model and a hysteretic Bouc-Wen oscillator, with the central claim that they accurately capture tail behavior to enable precise, efficient first-passage probability estimation.

Significance. If the surrogates preserve tail statistics without systematic bias, the work would provide a valuable computational advance for engineering reliability problems where direct Monte Carlo is intractable. The emphasis on biased designs targeting extremes is a strength, but the absence of explicit tail-validation metrics leaves the efficiency gain unquantified relative to existing time-dependent surrogate approaches.

major comments (3)
  1. [Abstract] Abstract: the claim that the frameworks 'accurately capture the tail behavior' and enable 'precise' first-passage probability estimates is load-bearing yet unsupported by any reported quantitative error measures (e.g., relative error, confidence intervals, or convergence rates) against brute-force Monte Carlo on the benchmark tails; visual agreement alone does not establish preservation of rare-event statistics.
  2. [Introduction / Methods] The reduced-manifold construction in mNARX and the principal-component extraction in F-NARX are presented as sufficient to represent extreme responses, but no analysis is given showing that information relevant to rare crossings (high-frequency content or specific nonlinear interactions) is retained after dimensionality reduction; this directly affects whether the biased-training claim holds without bias in the failure probability.
  3. [Numerical Examples] Benchmark results: while two systems are shown, the manuscript supplies no table or figure reporting the estimated first-passage probabilities with error bars or direct comparison to reference Monte Carlo tails, undermining the assertion of 'precise' estimation.
minor comments (2)
  1. [Methods] Notation for the auxiliary states in mNARX and the time-window length in F-NARX should be defined explicitly at first use to avoid ambiguity in the regression formulation.
  2. [Experimental Design] The description of the biased experimental design lacks a precise statement of the biasing distribution or importance-sampling weights; this detail is needed for reproducibility.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major comment below and will incorporate revisions to strengthen the quantitative support for our claims.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the claim that the frameworks 'accurately capture the tail behavior' and enable 'precise' first-passage probability estimates is load-bearing yet unsupported by any reported quantitative error measures (e.g., relative error, confidence intervals, or convergence rates) against brute-force Monte Carlo on the benchmark tails; visual agreement alone does not establish preservation of rare-event statistics.

    Authors: We agree that the abstract claim requires stronger quantitative backing. In the revised manuscript we will add relative errors, confidence intervals from repeated surrogate runs, and convergence diagnostics for the first-passage probabilities, all benchmarked directly against brute-force Monte Carlo on the reported tails. revision: yes

  2. Referee: [Introduction / Methods] The reduced-manifold construction in mNARX and the principal-component extraction in F-NARX are presented as sufficient to represent extreme responses, but no analysis is given showing that information relevant to rare crossings (high-frequency content or specific nonlinear interactions) is retained after dimensionality reduction; this directly affects whether the biased-training claim holds without bias in the failure probability.

    Authors: The manifold and PCA constructions are chosen to retain the dominant dynamics that govern extreme excursions, as confirmed by the matching response distributions in the numerical examples. To address the concern explicitly, we will add a supplementary comparison of power spectra and extreme-value statistics between the full and reduced representations. revision: yes

  3. Referee: [Numerical Examples] Benchmark results: while two systems are shown, the manuscript supplies no table or figure reporting the estimated first-passage probabilities with error bars or direct comparison to reference Monte Carlo tails, undermining the assertion of 'precise' estimation.

    Authors: We accept that a direct quantitative table is necessary. The revised version will contain a new table (and updated figure) that lists the surrogate-based first-passage probability estimates together with error bars and side-by-side Monte Carlo reference values for both benchmark systems. revision: yes

Circularity Check

0 steps flagged

No circularity in surrogate construction or tail-probability estimation

full rationale

The paper defines mNARX via reduced-order manifold regression on auxiliary states and F-NARX via principal-component features extracted from time windows, then trains both on simulation data from the quarter-car and Bouc-Wen benchmarks. First-passage probabilities are subsequently estimated from the trained surrogates on independent test trajectories. This chain does not reduce any claimed prediction to the fitting inputs by construction, nor does it rely on self-citations, uniqueness theorems, or ansatzes imported from prior author work. The central performance claim therefore remains externally falsifiable against the true dynamical models.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the assumption that the reduced-order manifold or functional features preserve tail statistics and that biased experimental designs can be chosen without knowledge of the failure region. No free parameters are explicitly named in the abstract, but the biased design and manifold dimension are implicit choices.

axioms (2)
  • domain assumption The dynamical system trajectories lie on a low-dimensional manifold that can be identified from auxiliary state variables.
    Invoked to justify the mNARX construction for high-dimensional inputs.
  • domain assumption Principal component features extracted from time windows capture the relevant long-memory behavior for tail estimation.
    Basis for the F-NARX approach.

pith-pipeline@v0.9.0 · 5531 in / 1326 out tokens · 55674 ms · 2026-05-13T03:20:55.193305+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

28 extracted references · 28 canonical work pages

  1. [1]

    Billings, S. A. (2013). Nonlinear system identification: NARMAX methods in the time, frequency, and spatio-temporal domains . Chichester, West Sussex, United Kingdom: John Wiley & Sons, Inc

  2. [2]

    Blatman, G. and B. Sudret (2011). Principal component analysis and least angle regression in spectral stochastic finite element analysis. In Proc. 11th Int. Conf. on Applications of Stat. and Prob. in Civil Engineering (ICASP11), Zurich, Switzerland

  3. [3]

    Luo, and Y

    Du, W., Y. Luo, and Y. Wang (2019). Time-variant reliability analysis using the parallel subset simulation. Reliability Engineering & System Safety\/ 182 , 250--257

  4. [4]

    Gayton, and M

    Echard, B., N. Gayton, and M. Lemaire (2011). AK-MCS: An active learning reliability method combining Kriging and Monte Carlo Simulation . Structural Safety\/ 33\/ (2), 145--154

  5. [5]

    Hastie, I

    Efron, B., T. Hastie, I. Johnstone, and R. Tibshirani (2004). Least angle regression . The Annals of Statistics\/ 32\/ (2), 407 -- 499

  6. [6]

    Ghanem, R. and P. Spanos (2003). Stochastic F inite E lements: A S pectral A pproach\/ (2nd ed.). Courier D over P ublications, Mineola

  7. [7]

    Hu, Z. and X. Du (2015). First order reliability method for time-variant problems using series expansions. Structural and Multidisciplinary Optimization\/ 51\/ (1), 1--21

  8. [8]

    Hu, Z. and S. Mahadevan (2016). A single-loop kriging surrogate modeling for time-dependent reliability analysis. Journal of Mechanical Design\/ 138\/ (6), 061406

  9. [9]

    Jolliffe, I. T. (2002). Principal component analysis\/ (2 ed.). Springer Series in Statistics. New York: Springer Verlag (Second Edition)

  10. [10]

    Le Ma \^ i tre, O. P., L. Mathelin, O. M. Knio, and M. Y. Hussaini (2010). Asynchronous time integration for polynomial chaos expansion of uncertain periodic dynamics . Discrete and Continuous Dynamical Systems\/ 28\/ (1), 199--226

  11. [11]

    Li, J. and J. Chen (2004). Probability density evolution method for dynamic response analysis of structures with uncertain parameters. Computational Mechanics\/ 34\/ (5), 400--409

  12. [12]

    Mai, C.-V. (2016). Polynomial chaos expansions for uncertain dynamical systems. Applications in earthquake engineering . Ph.\ D. thesis, ETH Z \"u rich, Switzerland

  13. [13]

    Mai, C.-V., M. D. Spiridonakos, E. Chatzi, and B. Sudret (2016). Surrogate modeling for stochastic dynamical systems by combining nonlinear autoregressive with exogeneous input models and polynomial chaos expansions. International Journal of Uncertainty Quantification\/ 6\/ (4), 313--339

  14. [14]

    Mai, C. V. and B. Sudret (2017). Surrogate models for oscillatory systems using sparse polynomial chaos expansions and stochastic time warping. SIAM/ASA Journal on Uncertainty Quantification\/ 5\/ (1), 540--571

  15. [15]

    Marelli, S. and B. Sudret (2018). An active-learning algorithm that combines sparse polynomial chaos expansions and bootstrap for structural reliability analysis. Structural Safety\/ 75 , 67--74

  16. [16]

    Marelli, and B

    Moustapha, M., S. Marelli, and B. Sudret (2022). Active learning for structural reliability: S urvey, general framework and benchmark. Structural Safety\/ 96 , 102174

  17. [17]

    Rao, K. R. and P. Yip (1990). Discrete Cosine Transform: Algorithms, Advantages, Applications . Boston: Academic Press

  18. [18]

    Rezaeian, S. and A. Der Kiureghian (2010). Simulation of synthetic ground motions for specified earthquake and site characteristics . Earthquake Engineering & Structural Dynamics\/ 39\/ (10), 1155--1180

  19. [19]

    Santner, T. J., B. J. Williams, and W. I. Notz (2003). The D esign and A nalysis of C omputer E xperiments . Springer, New York

  20. [20]

    Marelli, and B

    Sch\"ar, S., S. Marelli, and B. Sudret (2024). Emulating the dynamics of complex systems using autoregressive models on manifolds (mNARX) . Mechanical Systems and Signal Processing\/ 208 , 110956

  21. [21]

    Marelli, and B

    Sch \"a r, S., S. Marelli, and B. Sudret (2025). Surrogate modeling with functional nonlinear autoregressive models ( F -NARX) . Reliability Engineering & System Safety\/ , 111276

  22. [22]

    Marelli, and B

    Sch \"a r, S., S. Marelli, and B. Sudret (2026). mNARX+: A surrogate model for complex dynamical systems using manifold-NARX and automatic feature selection . Computer Methods in Applied Mechanics and Engineering\/ 449 , 118550

  23. [23]

    Song, Z., M. A. Valdebenito, S. Sch \"a r, S. Marelli, B. Sudret, and M. G. R. Faes (2026). Probabilistic function-on-function nonlinear autoregressive model for emulation and reliability analysis of dynamical systems. arXiv preprint 2602.01929, under revision

  24. [24]

    Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological)\/ 58\/ (1), 267--288

  25. [25]

    Cao, and Z

    Wang, J., R. Cao, and Z. Sun (2021). Importance sampling for time-variant reliability analysis. IEEE Access\/ 9 , 20933--20941

  26. [26]

    Wu, H. and X. Du (2022, 04). Envelope method for time- and space-dependent reliability prediction. ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems Part B: Mechanical Engineering\/ 8\/ (4), 041201

  27. [27]

    Xiu, D. and G. E. Karniadakis (2002, January). The Wiener-Askey polynomial chaos for stochastic differential equations . SIAM Journal on Scientific Computing\/ 24\/ (2), 619--644

  28. [28]

    Zhang, D., X. Han, C. Jiang, J. Liu, and Q. Li (2017). Time-dependent reliability analysis through response surface method. Journal of Mechanical Design\/ 139\/ (4), 041404