pith. sign in

arxiv: 2606.02623 · v1 · pith:4WDZTI7Enew · submitted 2026-05-29 · 💻 cs.NE · cs.AI· cs.LG

Oscillatory State-Space Models as Inductive Biases for Physics-Informed Neural PDE Solvers

Pith reviewed 2026-06-28 20:08 UTC · model grok-4.3

classification 💻 cs.NE cs.AIcs.LG
keywords physics-informed neural networksstate-space modelsoscillatory dynamicsPDE solversspectral methodsinductive biasestime-dependent PDEshigh-dimensional problems
0
0 comments X

The pith

Oscillatory state-space models for temporal evolution in PINNs enable closed-form spatial differentiation and consistent boundary conditions while improving accuracy and cutting memory versus sequence models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a physics-informed neural network that replaces general sequence models with linear-oscillator state-space dynamics to capture the time evolution of PDE solutions. This temporal component is paired with a PDE-aware spectral basis in space. The combination permits closed-form differentiation in space and straightforward enforcement of boundary conditions. Evaluations on forward, inverse, and high-dimensional PDE problems, including cases with up to 100 spatial dimensions, show gains in accuracy alongside lower memory use than recent sequence-model PINN baselines. The work therefore argues that structured dynamical priors aligned with PDE modal structure can make neural solvers both more accurate and more scalable.

Core claim

A PINN architecture that uses linear-oscillator-based state-space dynamics for temporal evolution together with a PDE-aware spectral basis in space achieves closed-form spatial differentiation, consistent boundary-condition enforcement, higher accuracy, and lower memory consumption than sequence-model-based PINN approaches when applied to forward, inverse, and high-dimensional time-dependent PDE problems up to 100 spatial dimensions.

What carries the argument

Linear-oscillator state-space model for temporal evolution combined with PDE-aware spectral basis for spatial representation, which together supply the structured inductive bias.

If this is right

  • Closed-form spatial differentiation becomes available without numerical approximation.
  • Boundary conditions can be enforced consistently across the domain.
  • Accuracy improves on both forward and inverse PDE problems relative to sequence-model baselines.
  • Memory requirements scale more favorably with sequence length and resolution.
  • The method remains applicable to problems with up to 100 spatial dimensions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same oscillator prior could be tested on time-dependent systems outside the PDE setting, such as ODE networks or control problems.
  • Replacing the linear oscillator with a nonlinear state-space variant might extend the approach to problems with stronger nonlinear temporal dynamics.
  • Lower memory footprints could support longer-time or ensemble simulations that current sequence models cannot reach.
  • The spectral spatial basis might combine with other temporal priors, such as Hamiltonian or symplectic structures, to create further physics-aligned architectures.

Load-bearing premise

The temporal evolution of the target PDE solutions can be represented by linear-oscillator state-space dynamics without substantial loss of fidelity.

What would settle it

A time-dependent PDE whose solution exhibits strongly nonlinear or chaotic temporal behavior where the oscillatory state-space model produces lower accuracy or higher memory use than a comparable sequence-model PINN baseline.

Figures

Figures reproduced from arXiv: 2606.02623 by Abhishek Chandra, Taniya Kapoor.

Figure 1
Figure 1. Figure 1: Latent dynamics on convection (top) and wave (bottom). Columns: solution heatmap, [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: OSSM-PINN architecture: The initial condition is encoded into an oscilla￾tory LinOSS latent state, whose temporal roll￾out produces modal coefficients through an MLP decoder. Coefficients are combined with spatial basis and boundary factor to form the spatio-temporal solution, which is trained through physics-informed loss. This section introduces OSSM-PINNs for solv￾ing time-dependent PDEs. Let Ω ⊂ R d be… view at source ↗
Figure 3
Figure 3. Figure 3: Convection (β = 50): predicted u(x, t) fields and absolute errors for each method. 0.0 2.5 5.0 x 0.0 0.4 0.8 t Reference u(x, t) 0.8 0.0 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0.0 0.5 1.0 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0.0 0.5 1.0 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute e… view at source ↗
Figure 4
Figure 4. Figure 4: Reaction: predicted u(x, t) fields and absolute errors for each method. benchmarks, with the largest gains on high-frequency convection, wave propagation, and Euler– Bernoulli beam problems. These results indicate that the oscillator–spectral factorization is effective across both first- and second-order-in-time PDEs, as well as high-order spatial operators. Figures 3 and 4, together with SM §E Figures 12,… view at source ↗
Figure 5
Figure 5. Figure 5: Frequency-domain comparison on the wave equation: predictions and Fourier spectra. 0 25 50 75 100 angular frequency ! 0.0 0.4 0.8 j^htopj conv ¯=50 conv ¯=100 wave EB 10 1 10 2 d o min a nt ! analytical ! OSSM-IM (FFT) OSSM-IMEX (FFT) analytical (bar) OSSM-IM (bar) OSSM-IMEX (bar) [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 7
Figure 7. Figure 7: SST inverse at t = 23 mo: reference (top), OSSM-IM prediction (mid), absolute error (bot). Full re￾sults in [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Schrödinger 100D: |ψ(x1, . . . , x100, t)| 2 at t = π/2 showing a slice in (x1, x2, x3). Reference (left), OSSM￾IM prediction (center), absolute error (right). L log Lip =37:9 PINNsFormer L log Lip =16:7 PINNMamba L log Lip =13:2 OSSM-PINN-IMEX [PITH_FULL_IMAGE:figures/full_fig_p009_8.png] view at source ↗
Figure 10
Figure 10. Figure 10: Forward-problem benchmark overview: key challenge and ground-truth solution for each [PITH_FULL_IMAGE:figures/full_fig_p027_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Benchmark overview (inverse, geometry, high-dimensional, problem-adapted-basis). [PITH_FULL_IMAGE:figures/full_fig_p028_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Wave equation: predicted u(x, t) fields and absolute errors for each method. 0 10 20 x 0.0 0.4 0.8 t Reference u(x, t) 0.8 0.0 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0 10 20 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0 4 8 0.0 0.4 0.8 t Predicted u(x, t) 0 10 20 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0 4 8 0.0 0.4 0.8 t Predicted u(x, t) 0 10 20 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0 4 8 0.0 0.4 0.… view at source ↗
Figure 13
Figure 13. Figure 13: Euler–Bernoulli beam (extended domain [0, 8π]): predicted u(x, t) fields and absolute errors for each method. 0.0 2.5 5.0 x 0.0 0.4 0.8 t Reference u(x, t) 0.8 0.0 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0.0 0.4 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0.0 0.4 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.… view at source ↗
Figure 14
Figure 14. Figure 14: Convection (β = 100): predicted u(x, t) fields and absolute errors for each method. 29 [PITH_FULL_IMAGE:figures/full_fig_p029_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Euler–Bernoulli beam (classical, [0, 2π]): predicted u(x, t) fields and absolute errors for each method. 30 [PITH_FULL_IMAGE:figures/full_fig_p030_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: Latent dynamics on all benchmarks in addition to problems shown in Figure 1. Columns: [PITH_FULL_IMAGE:figures/full_fig_p031_16.png] view at source ↗
Figure 17
Figure 17. Figure 17: Frequency-domain comparison on convection ( [PITH_FULL_IMAGE:figures/full_fig_p032_17.png] view at source ↗
Figure 18
Figure 18. Figure 18: Frequency-domain comparison on convection ( [PITH_FULL_IMAGE:figures/full_fig_p032_18.png] view at source ↗
Figure 19
Figure 19. Figure 19: Frequency-domain comparison on reaction at the final time: spatial Fourier magnitude [PITH_FULL_IMAGE:figures/full_fig_p032_19.png] view at source ↗
Figure 20
Figure 20. Figure 20: Frequency-domain comparison on Euler–Bernoulli beam (classical, [PITH_FULL_IMAGE:figures/full_fig_p033_20.png] view at source ↗
Figure 21
Figure 21. Figure 21: Frequency-domain comparison on extended Euler–Bernoulli at the final time: spatial [PITH_FULL_IMAGE:figures/full_fig_p033_21.png] view at source ↗
Figure 22
Figure 22. Figure 22: Loss-landscape on convection (β = 100) along top-2 Hessian eigenvectors. Smaller L log Lip indicates a smoother basin. 33 [PITH_FULL_IMAGE:figures/full_fig_p033_22.png] view at source ↗
Figure 23
Figure 23. Figure 23: KdV inverse problem: predicted u(x, t) fields and absolute errors. OSSM-PINN recovers both coefficients (λ1, λ2) to < 0.1% relative error. Reference t = 6 mo t = 12 mo t = 18 mo t = 23 mo 1 0 1 T (°C) OSSM-IM: Pred 1 0 1 T (°C) OSSM-IM: |err| 0.00 0.01 0.02 |err| (°C) OSSM-IMEX: Pred 1 0 1 T (°C) OSSM-IMEX: |err| 0.00 0.01 0.02 |err| (°C) [PITH_FULL_IMAGE:figures/full_fig_p034_23.png] view at source ↗
Figure 24
Figure 24. Figure 24: SST 2D advection–diffusion inverse problem: predicted temperature [PITH_FULL_IMAGE:figures/full_fig_p034_24.png] view at source ↗
Figure 25
Figure 25. Figure 25: KdV coefficient convergence during training. Dashed line: true parameter value. [PITH_FULL_IMAGE:figures/full_fig_p034_25.png] view at source ↗
Figure 26
Figure 26. Figure 26: SST coefficient convergence during training ( [PITH_FULL_IMAGE:figures/full_fig_p035_26.png] view at source ↗
Figure 27
Figure 27. Figure 27: Frequency-domain validation of KdV inverse recovery at final time: spatial Fourier [PITH_FULL_IMAGE:figures/full_fig_p035_27.png] view at source ↗
Figure 28
Figure 28. Figure 28: Problem-adapted basis functions. 6 0 6 x 0.0 1.5 3.0 t Reference u(x, t) 0.6 0.0 0.6 0.0 1.5 3.0 t Predicted u(x, t) 6 0 6 x 0.0 1.5 3.0 t Absolute error 0.6 0.0 0.6 0.000 0.015 0.030 0.0 1.5 3.0 t Predicted u(x, t) 6 0 6 x 0.0 1.5 3.0 t Absolute error 0.6 0.0 0.6 0.000 0.015 0.030 0.0 1.5 3.0 t Predicted u(x, t) 6 0 6 x 0.0 1.5 3.0 t Absolute error 0.6 0.0 0.6 0.000 0.015 0.030 0.0 1.5 3.0 t Predicted u(… view at source ↗
Figure 29
Figure 29. Figure 29: QHO 1D: predicted fields with Fourier vs. Hermite basis. [PITH_FULL_IMAGE:figures/full_fig_p036_29.png] view at source ↗
Figure 30
Figure 30. Figure 30: Pöschl–Teller well: predicted fields with Fourier vs. PT basis. [PITH_FULL_IMAGE:figures/full_fig_p037_30.png] view at source ↗
Figure 31
Figure 31. Figure 31: Taylor–Green vortex 2D (Re = 100): predicted velocity components u(x, y), v(x, y) and pressure p(x, y), with absolute errors. 0.0 0.4 0.8 y Pred t=0.1 0.0 0.4 0.8 x 0.0 0.4 0.8 y |err| t=0.1 0.5 0.0 0.5 0.000 0.004 0.008 0.0 0.4 0.8 y Pred t=0.5 0.0 0.4 0.8 x 0.0 0.4 0.8 y |err| t=0.5 0.5 0.0 0.5 0.000 0.004 0.008 0.0 0.4 0.8 y Pred t=0.9 0.0 0.4 0.8 x 0.0 0.4 0.8 y |err| t=0.9 0.5 0.0 0.5 0.000 0.004 0.0… view at source ↗
Figure 32
Figure 32. Figure 32: Heat equation on a triangular domain: predicted [PITH_FULL_IMAGE:figures/full_fig_p037_32.png] view at source ↗
Figure 33
Figure 33. Figure 33: Schrödinger 5D: |ψ(x1, . . . , x5, t)| 2 at t = π/2, showing a slice in (x1, x2, x3). Reference, OSSM-IM prediction, and absolute error. Reference 0.616 0.624 0.632 OSSM-IM Pred 0.616 0.624 0.632 OSSM-IM |err| 0.0000 0.0002 0.0004 OSSM-IMEX Pred 0.616 0.624 0.632 OSSM-IMEX |err| 0.0000 0.0002 0.0004 Schrödinger 100D --- jÃj2 on three faces of the inner cube [¼=8; 7¼=8]3 ½ [0; ¼]3 at t = 1:57; remaining 97… view at source ↗
Figure 34
Figure 34. Figure 34: Schrödinger 100D: |ψ(x1, . . . , x100, t)| 2 at t = π/2, showing a slice in (x1, x2, x3). Reference, OSSM-IM prediction, and absolute error. 38 [PITH_FULL_IMAGE:figures/full_fig_p038_34.png] view at source ↗
Figure 35
Figure 35. Figure 35: Training-loss histories for OSSM-PINN-IMEX (three seeds). [PITH_FULL_IMAGE:figures/full_fig_p039_35.png] view at source ↗
Figure 36
Figure 36. Figure 36: Training-loss histories for OSSM-PINN-IM (three seeds). [PITH_FULL_IMAGE:figures/full_fig_p040_36.png] view at source ↗
Figure 37
Figure 37. Figure 37: Seed stability: mean rMAE ±1 std over three seeds for seven benchmarks. 40 [PITH_FULL_IMAGE:figures/full_fig_p040_37.png] view at source ↗
read the original abstract

Solving time-dependent partial differential equations (PDEs) is an important problem in computational science and engineering. Physics-informed neural networks (PINNs) learn PDE solutions from governing equations. However, accurately capturing temporal evolution remains challenging. Recent sequence-model-based approaches parameterize time evolution using general-purpose sequence models, which capture temporal dependencies but do not explicitly encode the structured dynamics of PDE solutions. In addition, their memory requirements can scale unfavorably with sequence length and resolution, limiting applicability in large-scale or high-dimensional settings. This work introduces a PINN approach that incorporates oscillatory state-space dynamics to represent the modal structure of PDE solutions. The proposed method leverages a linear-oscillator-based temporal evolution, together with a PDE-aware spectral basis in space. This design enables closed-form spatial differentiation and consistent enforcement of boundary conditions. The method is evaluated on forward, inverse, and high-dimensional PDE problems, including cases up to 100 spatial dimensions. The results show improved accuracy and reduced memory usage compared to recent sequence-model-based PINN approaches. Overall, this work highlights the benefits of incorporating structured dynamical priors into the temporal evolution of neural PDE solvers and suggests designing more physics-aligned and computationally efficient PINN architectures.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The manuscript proposes a physics-informed neural network (PINN) architecture that uses oscillatory state-space models to parameterize temporal evolution of PDE solutions, paired with a PDE-aware spectral basis for spatial discretization. This enables closed-form spatial derivatives and boundary condition enforcement. The approach is evaluated on forward, inverse, and high-dimensional (up to 100D) PDE problems and is reported to outperform recent sequence-model-based PINNs in accuracy while using less memory.

Significance. If the empirical gains hold under rigorous verification, the work would demonstrate the value of embedding structured linear dynamical priors into PINN temporal modules, offering a route to scalable solvers for high-dimensional time-dependent PDEs where general sequence models become memory-intensive.

major comments (2)
  1. [Abstract] Abstract: The central claim that linear-oscillator state-space dynamics represent the modal temporal evolution 'without significant loss of fidelity' for the target PDEs is load-bearing, yet the abstract provides no indication of how the model is modified or regularized when the underlying PDE exhibits damping, exponential decay, or strong nonlinearity (e.g., parabolic or chaotic regimes).
  2. [Abstract] Abstract: The reported accuracy and memory improvements are presented without reference to specific PDE families tested, sequence lengths, spatial resolutions, or quantitative baselines (error bars, number of runs), making it impossible to assess whether the gains are robust or confined to pre-selected oscillatory problems.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed comments on the abstract. We respond point by point below and will revise the abstract for clarity where appropriate.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The central claim that linear-oscillator state-space dynamics represent the modal temporal evolution 'without significant loss of fidelity' for the target PDEs is load-bearing, yet the abstract provides no indication of how the model is modified or regularized when the underlying PDE exhibits damping, exponential decay, or strong nonlinearity (e.g., parabolic or chaotic regimes).

    Authors: The manuscript positions the oscillatory state-space model as an inductive bias specifically for PDEs whose solutions exhibit modal oscillatory structure (see Introduction and Section 3). No explicit modification or regularization for damping/strong nonlinearity is introduced because the target problems are those where the linear oscillator prior aligns with the physics; applicability outside this regime is discussed as a limitation in the conclusion. We agree the abstract should better delimit scope and will revise it to state that the approach targets oscillatory modal evolution. revision: yes

  2. Referee: [Abstract] Abstract: The reported accuracy and memory improvements are presented without reference to specific PDE families tested, sequence lengths, spatial resolutions, or quantitative baselines (error bars, number of runs), making it impossible to assess whether the gains are robust or confined to pre-selected oscillatory problems.

    Authors: The abstract is intentionally high-level; concrete PDE families (wave, Schrödinger, etc.), sequence lengths, resolutions, and quantitative results (including error bars over multiple runs) appear in Section 4 and the associated tables/figures. We acknowledge that the abstract could better signal the breadth of evaluation and will revise it to name example PDE families and note that results include statistical quantification over repeated trials. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical method proposal with independent evaluation

full rationale

The paper introduces an oscillatory state-space model as an inductive bias for PINNs, combined with a spectral spatial basis. The abstract and description frame this as an architectural choice evaluated empirically on forward/inverse/high-dimensional PDE tasks, with reported gains in accuracy and memory. No equations, fitting procedures, or derivation steps are presented that reduce a claimed prediction or result to a fitted input, self-definition, or self-citation chain. The central claim remains an empirical improvement over sequence-model baselines rather than a self-referential derivation. This is a standard self-contained proposal; no load-bearing step collapses by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies no explicit free parameters, axioms, or invented entities; the central claim rests on the unstated premise that linear oscillators capture the modal structure of the target PDEs.

pith-pipeline@v0.9.1-grok · 5749 in / 1099 out tokens · 31904 ms · 2026-06-28T20:08:33.935453+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

51 extracted references · 29 canonical work pages · 1 internal anchor

  1. [1]

    , TITLE =

    Lawrence C. Evans.Partial differential equations, volume 19 ofGraduate Studies in Mathemat- ics. American Mathematical Society, 2nd edition, 2010. doi: 10.1090/gsm/019

  2. [2]

    Maziar Raissi, Paris Perdikaris, and George Em Karniadakis. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations.Journal of Computational Physics, 378:686–707, 2019. doi: 10.1016/j.jcp.2018.10.045

  3. [3]

    E., Kevrekidis, I

    George Em Karniadakis, Ioannis G. Kevrekidis, Lu Lu, Paris Perdikaris, Sifan Wang, and Liu Yang. Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021. doi: 10.1038/s42254-021-00314-5

  4. [4]

    Molina Catricheo, Fabrice Lambert, Julien Salomon, and Elwin van’t Wout

    Constanza A. Molina Catricheo, Fabrice Lambert, Julien Salomon, and Elwin van’t Wout. Mod- eling global surface dust deposition using physics-informed neural networks.Communications Earth & Environment, 5(1):778, 2024. doi: 10.1038/s43247-024-01942-2

  5. [5]

    PhD thesis, University of Oxford, 2022

    Benjamin Moseley.Physics-informed machine learning: From concepts to real-world applica- tions. PhD thesis, University of Oxford, 2022

  6. [6]

    Aditi Krishnapriyan, Amir Gholami, Shandian Zhe, Robert Kirby, and Michael W. Mahoney. Characterizing possible failure modes in physics-informed neural net- works. InAdvances in Neural Information Processing Systems, volume 34, pages 26548–26560, 2021. URL https://proceedings.neurips.cc/paper/2021/hash/ df438e5206f31600e6ae4af72f2725f1-Abstract.html

  7. [7]

    Understanding and mitigating gradient flow pathologies in physics-informed neural networks.SIAM Journal on Scientific Computing, 43 (5):A3055–A3081, 2021

    Sifan Wang, Yujun Teng, and Paris Perdikaris. Understanding and mitigating gradient flow pathologies in physics-informed neural networks.SIAM Journal on Scientific Computing, 43 (5):A3055–A3081, 2021. doi: 10.1137/20M1318043

  8. [8]

    When and why PINNs fail to train: A neural tangent kernel perspective.Journal of Computational Physics, 449:110768, 2022

    Sifan Wang, Xinling Yu, and Paris Perdikaris. When and why PINNs fail to train: A neural tangent kernel perspective.Journal of Computational Physics, 449:110768, 2022. doi: 10.1016/ j.jcp.2021.110768

  9. [9]

    Physics-informed neu- ral networks for solving forward and inverse problems in complex beam systems.IEEE Transactions on Neural Networks and Learning Systems, 35(5):5981–5995, 2023

    Taniya Kapoor, Hongrui Wang, Alfredo Núñez, and Rolf Dollevoet. Physics-informed neu- ral networks for solving forward and inverse problems in complex beam systems.IEEE Transactions on Neural Networks and Learning Systems, 35(5):5981–5995, 2023. doi: 10.1109/TNNLS.2023.3310585

  10. [10]

    Challenges in training PINNs: A loss landscape perspective

    Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, and Madeleine Udell. Challenges in training PINNs: A loss landscape perspective. InInternational Conference on Machine Learning, pages 42384–42409. PMLR, 2024. URL https://proceedings.mlr.press/ v235/rathore24a.html

  11. [11]

    Respecting causality for training physics- informed neural networks.Computer Methods in Applied Mechanics and Engineering, 421: 116813, 2024

    Sifan Wang, Shyam Sankaran, and Paris Perdikaris. Respecting causality for training physics- informed neural networks.Computer Methods in Applied Mechanics and Engineering, 421: 116813, 2024. doi: 10.1016/j.cma.2024.116813

  12. [12]

    Wight and Jia Zhao

    Colby L. Wight and Jia Zhao. Solving Allen-Cahn and Cahn-Hilliard equations using the adaptive physics informed neural networks.Communications in Computational Physics, 29(3): 930–954, 2021. doi: 10.4208/cicp.OA-2020-0086

  13. [13]

    Revanth Mattey and Susanta Ghosh. A novel sequential method to train physics informed neural networks for Allen-Cahn and Cahn-Hilliard equations.Computer Methods in Applied Mechanics and Engineering, 390:114474, 2022. doi: 10.1016/j.cma.2021.114474

  14. [14]

    Jagtap, Shandian Zhe, George Em Karniadakis, and Robert M

    Michael Penwarden, Ameya D. Jagtap, Shandian Zhe, George Em Karniadakis, and Robert M. Kirby. A unified scalable framework for causal sweeping strategies for physics-informed neural networks (PINNs) and their temporal decompositions.Journal of Computational Physics, 493: 112464, 2023. doi: 10.1016/j.jcp.2023.112464. 10

  15. [15]

    PPINN: Parareal physics- informed neural network for time-dependent PDEs.Computer Methods in Applied Mechanics and Engineering, 370:113250, 2020

    Xuhui Meng, Zhen Li, Dongkun Zhang, and George Em Karniadakis. PPINN: Parareal physics- informed neural network for time-dependent PDEs.Computer Methods in Applied Mechanics and Engineering, 370:113250, 2020. doi: 10.1016/j.cma.2020.113250

  16. [16]

    Aditya Prakash

    Zhiyuan Zhao, Xueying Ding, and B. Aditya Prakash. PINNsFormer: A transformer-based framework for physics-informed neural networks. InThe Twelfth International Conference on Learning Representations, 2024. URL https://openreview.net/forum?id=DO2WFXU1Be

  17. [17]

    Sub-sequential physics-informed learning with state space model

    Chenhui Xu, Dancheng Liu, Yuting Hu, Jiajie Li, Ruiyang Qin, Qingxiao Zheng, and Jinjun Xiong. Sub-sequential physics-informed learning with state space model. InInternational Conference on Machine Learning, 2025. URL https://icml.cc/virtual/2025/poster/ 45079

  18. [18]

    ML-PINN: A memory-efficient physics- informed Mamba-LSTM network for fast and accurate PDE solving.Neurocomputing, page 131446, 2025

    YiMing Gao, Bing Wang, Jingyi Lu, and Zhou Tian. ML-PINN: A memory-efficient physics- informed Mamba-LSTM network for fast and accurate PDE solving.Neurocomputing, page 131446, 2025. doi: 10.1016/j.neucom.2025.131446

  19. [19]

    Moreira, Márcio Marques, Leonardo Mendonça, Christian Júnior de Oliveira, Vitor Balestro, Lucas dos Santos Fernandez, Daniel Yukimura, Pavel Petrov, João M

    Arthur Bizzi, Leonardo M. Moreira, Márcio Marques, Leonardo Mendonça, Christian Júnior de Oliveira, Vitor Balestro, Lucas dos Santos Fernandez, Daniel Yukimura, Pavel Petrov, João M. Pereira, et al. Neuro-spectral architectures for causal physics-informed networks. InAdvances in Neural Information Processing Systems, volume 38, 2025. URL https: //neurips....

  20. [20]

    Konstantin Rusch and Daniela Rus

    T. Konstantin Rusch and Daniela Rus. Oscillatory state-space models. InThe Thirteenth International Conference on Learning Representations, 2025. URL https://openreview. net/forum?id=GRMfXcAAFh

  21. [21]

    Blelloch

    Guy E. Blelloch. Prefix sums and their applications. Technical Report CMU-CS-90-190, Carnegie Mellon University, School of Computer Science, 1990

  22. [22]

    Improvements of the daily optimum interpolation sea surface temperature (DOISST) version 2.1.Journal of Climate, 34(8):2923–2939, 2021

    Boyin Huang, Chunying Liu, Viva Banzon, Eric Freeman, Garrett Graham, Bill Hankins, Tom Smith, and Huai-Min Zhang. Improvements of the daily optimum interpolation sea surface temperature (DOISST) version 2.1.Journal of Climate, 34(8):2923–2939, 2021. doi: 10.1175/JCLI-D-20-0166.1

  23. [23]

    Gradient-enhanced physics- informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022

    Jeremy Yu, Lu Lu, Xuhui Meng, and George Em Karniadakis. Gradient-enhanced physics- informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022. doi: 10.1016/j.cma.2022.114823

  24. [24]

    Seidman, and Paris Perdikaris

    Sifan Wang, Hanwen Wang, Jacob H. Seidman, and Paris Perdikaris. Random weight factorization improves the training of continuous neural representations.arXiv preprint arXiv:2210.01274, 2022. URLhttps://arxiv.org/abs/2210.01274

  25. [25]

    Urbán, Jérôme Darbon, and George Em Karniadakis

    Elham Kiyani, Khemraj Shukla, Jorge F. Urbán, Jérôme Darbon, and George Em Karniadakis. Optimizing the optimizer for physics-informed neural networks and Kolmogorov–Arnold networks.Computer Methods in Applied Mechanics and Engineering, 446:118308, 2025. doi: 10.1016/j.cma.2025.118308

  26. [26]

    Lagaris, A

    Isaac E. Lagaris, Aristidis Likas, and Dimitrios I. Fotiadis. Artificial neural networks for solving ordinary and partial differential equations.IEEE Transactions on Neural Networks, 9 (5):987–1000, 1998. doi: 10.1109/72.712178

  27. [27]

    A unified deep artificial neural network approach to partial differential equations in complex geometries.Neurocomputing, 317:28–41, 2018

    Jens Berg and Kaj Nyström. A unified deep artificial neural network approach to partial differential equations in complex geometries.Neurocomputing, 317:28–41, 2018. doi: 10.1016/ j.neucom.2018.06.056

  28. [28]

    DeepXDE: A deep learning library for solving differential equations.SIAM Review, 63(1):208–228, 2021

    Lu Lu, Xuhui Meng, Zhiping Mao, and George Em Karniadakis. DeepXDE: A deep learning library for solving differential equations.SIAM Review, 63(1):208–228, 2021. doi: 10.1137/ 19M1274067

  29. [29]

    Chenxi Wu, Min Zhu, Qinyang Tan, Yadhu Kartha, and Lu Lu. A comprehensive study of non- adaptive and residual-based adaptive sampling for physics-informed neural networks.Computer Methods in Applied Mechanics and Engineering, 403:115671, 2023. doi: 10.1016/j.cma.2022. 115671. 11

  30. [30]

    Mitigating propagation failures in physics-informed neural networks using retain-resample-release (R3) sampling

    Arka Daw, Jie Bu, Sifan Wang, Paris Perdikaris, and Anuj Karpatne. Mitigating propagation failures in physics-informed neural networks using retain-resample-release (R3) sampling. InInternational Conference on Machine Learning, pages 7264–7302. PMLR, 2023. URL https://proceedings.mlr.press/v202/daw23a.html

  31. [31]

    Failure-informed adaptive sampling for PINNs.SIAM Journal on Scientific Computing, 45(4):A1971–A1994, 2023

    Zhiping Gao, Liang Yan, and Tao Zhou. Failure-informed adaptive sampling for PINNs.SIAM Journal on Scientific Computing, 45(4):A1971–A1994, 2023. doi: 10.1137/22M1527763

  32. [32]

    Yifan Du and Tamer A. Zaki. Evolutional deep neural network.Physical Review E, 104(4): 045303, 2021. doi: 10.1103/PhysRevE.104.045303

  33. [33]

    Mamba: Linear-time sequence modeling with selective state spaces

    Albert Gu and Tri Dao. Mamba: Linear-time sequence modeling with selective state spaces. InFirst Conference on Language Modeling, 2024. URL https://openreview.net/forum? id=tEYskw1VY2

  34. [34]

    Efficiently modeling long sequences with structured state spaces

    Albert Gu, Karan Goel, and Christopher Ré. Efficiently modeling long sequences with structured state spaces. InThe Tenth International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=uYLFoz1vlAC

  35. [35]

    Jimmy T. H. Smith, Andrew Warrington, and Scott W. Linderman. Simplified state space layers for sequence modeling. InThe Eleventh International Conference on Learning Representations,

  36. [36]

    URLhttps://openreview.net/forum?id=Ai8Hw3AXqks

  37. [37]

    Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, and Soham De

    Antonio Orvieto, Samuel L. Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, and Soham De. Resurrecting recurrent neural networks for long sequences. In International Conference on Machine Learning, pages 26670–26698. PMLR, 2023. URL https://proceedings.mlr.press/v202/orvieto23a.html

  38. [38]

    Fourier neural operator for parametric partial dif- ferential equations

    Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Fourier neural operator for parametric partial dif- ferential equations. InInternational Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=c8P9NQVtmnO

  39. [39]

    doi:10.1038/s42256-021-00302-5 Lu Lu, Raphaël Pestourie, Steven G

    Lu Lu, Pengzhan Jin, Guofei Pang, Zhongqiang Zhang, and George Em Karniadakis. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature Machine Intelligence, 3(3):218–229, 2021. doi: 10.1038/s42256-021-00302-5

  40. [40]

    Neural Operator: Graph Kernel Network for Partial Differential Equations

    Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Neural operator: Graph kernel network for partial differential equations.arXiv preprint arXiv:2003.03485, 2020. URL https://arxiv.org/ abs/2003.03485

  41. [41]

    Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40): eabi8605, 2021

    Sifan Wang, Hanwen Wang, and Paris Perdikaris. Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40): eabi8605, 2021. doi: 10.1126/sciadv.abi8605

  42. [42]

    Fanaskov and Ivan V

    Vladimir S. Fanaskov and Ivan V . Oseledets. Spectral neural operators.Doklady Mathematics, 108(Suppl 2):S226–S232, 2023. doi: 10.1134/S1064562423701107

  43. [43]

    Jagtap, Ehsan Kharazmi, and George Em Karniadakis

    Ameya D. Jagtap, Ehsan Kharazmi, and George Em Karniadakis. Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems.Computer Methods in Applied Mechanics and Engineering, 365:113028, 2020. doi: 10.1016/j.cma.2020.113028

  44. [44]

    Jagtap and George Em Karniadakis

    Ameya D. Jagtap and George Em Karniadakis. Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations.Communications in Computational Physics, 28(5): 2002–2041, 2020. doi: 10.4208/cicp.OA-2020-0164

  45. [45]

    doi:10.1016/j

    Sifan Wang, Hanwen Wang, and Paris Perdikaris. On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering, 384:113938, 2021. doi: 10.1016/j. cma.2021.113938. 12

  46. [46]

    Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 33: 6696–6707, 2020

    Patrick Kidger, James Morrill, James Foster, and Terry Lyons. Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 33: 6696–6707, 2020

  47. [47]

    Neural rough differential equations for long time series

    James Morrill, Cristopher Salvi, Patrick Kidger, and James Foster. Neural rough differential equations for long time series. InInternational Conference on Machine Learning, pages 7829–7838. PMLR, 2021

  48. [48]

    Log neural controlled differential equations: The lie brackets make a difference.arXiv preprint arXiv:2402.18512, 2024

    Benjamin Walker, Andrew D McLeod, Tiexin Qin, Yichuan Cheng, Haoliang Li, and Terry Lyons. Log neural controlled differential equations: The lie brackets make a difference.arXiv preprint arXiv:2402.18512, 2024. 13 Supplementary Material Contents §A Extended Related Work p. 14 §B OSSM-PINN Pseudocode p. 16 §C Experiment Overview and Benchmark Suite p. 16 §...

  49. [49]

    Wang et al

    traced failures to imbalanced gradient flow between loss terms. Wang et al. [8] provided a neural tangent kernel analysis revealing spectral bias toward low frequencies, and Rathore et al

  50. [50]

    These analyses motivate two complementary directions: improving the training procedure (loss weighting, optimizers, sampling) and improving the architecture

    showed that PINN loss surfaces contain narrow valleys with ill-conditioned curvature. These analyses motivate two complementary directions: improving the training procedure (loss weighting, optimizers, sampling) and improving the architecture. A.2 Training Improvements: Weighting, Optimization, and Sampling Adaptive loss weighting [7] adjusts the balance ...

  51. [51]

    Using the Hermite basis reduces QHO rMAE from 9.5×10 −3 to 1.9×10 −4, a 50× improvement at no architectural cost (Figure 29). I.0.2 Nonlinear Schrödinger Equation We consider the cubic NLS iψt + 1 2 ψxx +|ψ| 2ψ= 0 on (x, t)∈[−5,5]×[0, π/2] with periodic boundary conditions (ψandψ x matched atx=±5) and the soliton-like initial condition ψ(x,0) = 2 sech(x),...