Oscillatory State-Space Models as Inductive Biases for Physics-Informed Neural PDE Solvers

Abhishek Chandra; Taniya Kapoor

arxiv: 2606.02623 · v1 · pith:4WDZTI7Enew · submitted 2026-05-29 · 💻 cs.NE · cs.AI· cs.LG

Oscillatory State-Space Models as Inductive Biases for Physics-Informed Neural PDE Solvers

Abhishek Chandra , Taniya Kapoor This is my paper

Pith reviewed 2026-06-28 20:08 UTC · model grok-4.3

classification 💻 cs.NE cs.AIcs.LG

keywords physics-informed neural networksstate-space modelsoscillatory dynamicsPDE solversspectral methodsinductive biasestime-dependent PDEshigh-dimensional problems

0 comments

The pith

Oscillatory state-space models for temporal evolution in PINNs enable closed-form spatial differentiation and consistent boundary conditions while improving accuracy and cutting memory versus sequence models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a physics-informed neural network that replaces general sequence models with linear-oscillator state-space dynamics to capture the time evolution of PDE solutions. This temporal component is paired with a PDE-aware spectral basis in space. The combination permits closed-form differentiation in space and straightforward enforcement of boundary conditions. Evaluations on forward, inverse, and high-dimensional PDE problems, including cases with up to 100 spatial dimensions, show gains in accuracy alongside lower memory use than recent sequence-model PINN baselines. The work therefore argues that structured dynamical priors aligned with PDE modal structure can make neural solvers both more accurate and more scalable.

Core claim

A PINN architecture that uses linear-oscillator-based state-space dynamics for temporal evolution together with a PDE-aware spectral basis in space achieves closed-form spatial differentiation, consistent boundary-condition enforcement, higher accuracy, and lower memory consumption than sequence-model-based PINN approaches when applied to forward, inverse, and high-dimensional time-dependent PDE problems up to 100 spatial dimensions.

What carries the argument

Linear-oscillator state-space model for temporal evolution combined with PDE-aware spectral basis for spatial representation, which together supply the structured inductive bias.

If this is right

Closed-form spatial differentiation becomes available without numerical approximation.
Boundary conditions can be enforced consistently across the domain.
Accuracy improves on both forward and inverse PDE problems relative to sequence-model baselines.
Memory requirements scale more favorably with sequence length and resolution.
The method remains applicable to problems with up to 100 spatial dimensions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same oscillator prior could be tested on time-dependent systems outside the PDE setting, such as ODE networks or control problems.
Replacing the linear oscillator with a nonlinear state-space variant might extend the approach to problems with stronger nonlinear temporal dynamics.
Lower memory footprints could support longer-time or ensemble simulations that current sequence models cannot reach.
The spectral spatial basis might combine with other temporal priors, such as Hamiltonian or symplectic structures, to create further physics-aligned architectures.

Load-bearing premise

The temporal evolution of the target PDE solutions can be represented by linear-oscillator state-space dynamics without substantial loss of fidelity.

What would settle it

A time-dependent PDE whose solution exhibits strongly nonlinear or chaotic temporal behavior where the oscillatory state-space model produces lower accuracy or higher memory use than a comparable sequence-model PINN baseline.

Figures

Figures reproduced from arXiv: 2606.02623 by Abhishek Chandra, Taniya Kapoor.

**Figure 2.** Figure 2: OSSM-PINN architecture: The initial condition is encoded into an oscillatory LinOSS latent state, whose temporal rollout produces modal coefficients through an MLP decoder. Coefficients are combined with spatial basis and boundary factor to form the spatio-temporal solution, which is trained through physics-informed loss. This section introduces OSSM-PINNs for solving time-dependent PDEs. Let Ω ⊂ R d be… view at source ↗

**Figure 3.** Figure 3: Convection (β = 50): predicted u(x, t) fields and absolute errors for each method. 0.0 2.5 5.0 x 0.0 0.4 0.8 t Reference u(x, t) 0.8 0.0 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0.0 0.5 1.0 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0.0 0.5 1.0 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute e… view at source ↗

**Figure 4.** Figure 4: Reaction: predicted u(x, t) fields and absolute errors for each method. benchmarks, with the largest gains on high-frequency convection, wave propagation, and Euler– Bernoulli beam problems. These results indicate that the oscillator–spectral factorization is effective across both first- and second-order-in-time PDEs, as well as high-order spatial operators. Figures 3 and 4, together with SM §E Figures 12,… view at source ↗

**Figure 5.** Figure 5: Frequency-domain comparison on the wave equation: predictions and Fourier spectra. 0 25 50 75 100 angular frequency ! 0.0 0.4 0.8 j^htopj conv ¯=50 conv ¯=100 wave EB 10 1 10 2 d o min a nt ! analytical ! OSSM-IM (FFT) OSSM-IMEX (FFT) analytical (bar) OSSM-IM (bar) OSSM-IMEX (bar) [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 7.** Figure 7: SST inverse at t = 23 mo: reference (top), OSSM-IM prediction (mid), absolute error (bot). Full results in [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗

**Figure 8.** Figure 8: Schrödinger 100D: |ψ(x1, . . . , x100, t)| 2 at t = π/2 showing a slice in (x1, x2, x3). Reference (left), OSSMIM prediction (center), absolute error (right). L log Lip =37:9 PINNsFormer L log Lip =16:7 PINNMamba L log Lip =13:2 OSSM-PINN-IMEX [PITH_FULL_IMAGE:figures/full_fig_p009_8.png] view at source ↗

**Figure 10.** Figure 10: Forward-problem benchmark overview: key challenge and ground-truth solution for each [PITH_FULL_IMAGE:figures/full_fig_p027_10.png] view at source ↗

**Figure 11.** Figure 11: Benchmark overview (inverse, geometry, high-dimensional, problem-adapted-basis). [PITH_FULL_IMAGE:figures/full_fig_p028_11.png] view at source ↗

**Figure 12.** Figure 12: Wave equation: predicted u(x, t) fields and absolute errors for each method. 0 10 20 x 0.0 0.4 0.8 t Reference u(x, t) 0.8 0.0 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0 10 20 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0 4 8 0.0 0.4 0.8 t Predicted u(x, t) 0 10 20 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0 4 8 0.0 0.4 0.8 t Predicted u(x, t) 0 10 20 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0 4 8 0.0 0.4 0.… view at source ↗

**Figure 13.** Figure 13: Euler–Bernoulli beam (extended domain [0, 8π]): predicted u(x, t) fields and absolute errors for each method. 0.0 2.5 5.0 x 0.0 0.4 0.8 t Reference u(x, t) 0.8 0.0 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0.0 0.4 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.0 x 0.0 0.4 0.8 t Absolute error 0.8 0.0 0.8 0.0 0.4 0.8 0.0 0.4 0.8 t Predicted u(x, t) 0.0 2.5 5.… view at source ↗

**Figure 14.** Figure 14: Convection (β = 100): predicted u(x, t) fields and absolute errors for each method. 29 [PITH_FULL_IMAGE:figures/full_fig_p029_14.png] view at source ↗

**Figure 15.** Figure 15: Euler–Bernoulli beam (classical, [0, 2π]): predicted u(x, t) fields and absolute errors for each method. 30 [PITH_FULL_IMAGE:figures/full_fig_p030_15.png] view at source ↗

**Figure 16.** Figure 16: Latent dynamics on all benchmarks in addition to problems shown in Figure 1. Columns: [PITH_FULL_IMAGE:figures/full_fig_p031_16.png] view at source ↗

**Figure 17.** Figure 17: Frequency-domain comparison on convection ( [PITH_FULL_IMAGE:figures/full_fig_p032_17.png] view at source ↗

**Figure 18.** Figure 18: Frequency-domain comparison on convection ( [PITH_FULL_IMAGE:figures/full_fig_p032_18.png] view at source ↗

**Figure 19.** Figure 19: Frequency-domain comparison on reaction at the final time: spatial Fourier magnitude [PITH_FULL_IMAGE:figures/full_fig_p032_19.png] view at source ↗

**Figure 20.** Figure 20: Frequency-domain comparison on Euler–Bernoulli beam (classical, [PITH_FULL_IMAGE:figures/full_fig_p033_20.png] view at source ↗

**Figure 21.** Figure 21: Frequency-domain comparison on extended Euler–Bernoulli at the final time: spatial [PITH_FULL_IMAGE:figures/full_fig_p033_21.png] view at source ↗

**Figure 22.** Figure 22: Loss-landscape on convection (β = 100) along top-2 Hessian eigenvectors. Smaller L log Lip indicates a smoother basin. 33 [PITH_FULL_IMAGE:figures/full_fig_p033_22.png] view at source ↗

**Figure 23.** Figure 23: KdV inverse problem: predicted u(x, t) fields and absolute errors. OSSM-PINN recovers both coefficients (λ1, λ2) to < 0.1% relative error. Reference t = 6 mo t = 12 mo t = 18 mo t = 23 mo 1 0 1 T (°C) OSSM-IM: Pred 1 0 1 T (°C) OSSM-IM: |err| 0.00 0.01 0.02 |err| (°C) OSSM-IMEX: Pred 1 0 1 T (°C) OSSM-IMEX: |err| 0.00 0.01 0.02 |err| (°C) [PITH_FULL_IMAGE:figures/full_fig_p034_23.png] view at source ↗

**Figure 24.** Figure 24: SST 2D advection–diffusion inverse problem: predicted temperature [PITH_FULL_IMAGE:figures/full_fig_p034_24.png] view at source ↗

**Figure 25.** Figure 25: KdV coefficient convergence during training. Dashed line: true parameter value. [PITH_FULL_IMAGE:figures/full_fig_p034_25.png] view at source ↗

**Figure 26.** Figure 26: SST coefficient convergence during training ( [PITH_FULL_IMAGE:figures/full_fig_p035_26.png] view at source ↗

**Figure 27.** Figure 27: Frequency-domain validation of KdV inverse recovery at final time: spatial Fourier [PITH_FULL_IMAGE:figures/full_fig_p035_27.png] view at source ↗

**Figure 28.** Figure 28: Problem-adapted basis functions. 6 0 6 x 0.0 1.5 3.0 t Reference u(x, t) 0.6 0.0 0.6 0.0 1.5 3.0 t Predicted u(x, t) 6 0 6 x 0.0 1.5 3.0 t Absolute error 0.6 0.0 0.6 0.000 0.015 0.030 0.0 1.5 3.0 t Predicted u(x, t) 6 0 6 x 0.0 1.5 3.0 t Absolute error 0.6 0.0 0.6 0.000 0.015 0.030 0.0 1.5 3.0 t Predicted u(x, t) 6 0 6 x 0.0 1.5 3.0 t Absolute error 0.6 0.0 0.6 0.000 0.015 0.030 0.0 1.5 3.0 t Predicted u(… view at source ↗

**Figure 29.** Figure 29: QHO 1D: predicted fields with Fourier vs. Hermite basis. [PITH_FULL_IMAGE:figures/full_fig_p036_29.png] view at source ↗

**Figure 30.** Figure 30: Pöschl–Teller well: predicted fields with Fourier vs. PT basis. [PITH_FULL_IMAGE:figures/full_fig_p037_30.png] view at source ↗

**Figure 31.** Figure 31: Taylor–Green vortex 2D (Re = 100): predicted velocity components u(x, y), v(x, y) and pressure p(x, y), with absolute errors. 0.0 0.4 0.8 y Pred t=0.1 0.0 0.4 0.8 x 0.0 0.4 0.8 y |err| t=0.1 0.5 0.0 0.5 0.000 0.004 0.008 0.0 0.4 0.8 y Pred t=0.5 0.0 0.4 0.8 x 0.0 0.4 0.8 y |err| t=0.5 0.5 0.0 0.5 0.000 0.004 0.008 0.0 0.4 0.8 y Pred t=0.9 0.0 0.4 0.8 x 0.0 0.4 0.8 y |err| t=0.9 0.5 0.0 0.5 0.000 0.004 0.0… view at source ↗

**Figure 32.** Figure 32: Heat equation on a triangular domain: predicted [PITH_FULL_IMAGE:figures/full_fig_p037_32.png] view at source ↗

**Figure 33.** Figure 33: Schrödinger 5D: |ψ(x1, . . . , x5, t)| 2 at t = π/2, showing a slice in (x1, x2, x3). Reference, OSSM-IM prediction, and absolute error. Reference 0.616 0.624 0.632 OSSM-IM Pred 0.616 0.624 0.632 OSSM-IM |err| 0.0000 0.0002 0.0004 OSSM-IMEX Pred 0.616 0.624 0.632 OSSM-IMEX |err| 0.0000 0.0002 0.0004 Schrödinger 100D --- jÃj2 on three faces of the inner cube [¼=8; 7¼=8]3 ½ [0; ¼]3 at t = 1:57; remaining 97… view at source ↗

**Figure 34.** Figure 34: Schrödinger 100D: |ψ(x1, . . . , x100, t)| 2 at t = π/2, showing a slice in (x1, x2, x3). Reference, OSSM-IM prediction, and absolute error. 38 [PITH_FULL_IMAGE:figures/full_fig_p038_34.png] view at source ↗

**Figure 35.** Figure 35: Training-loss histories for OSSM-PINN-IMEX (three seeds). [PITH_FULL_IMAGE:figures/full_fig_p039_35.png] view at source ↗

**Figure 36.** Figure 36: Training-loss histories for OSSM-PINN-IM (three seeds). [PITH_FULL_IMAGE:figures/full_fig_p040_36.png] view at source ↗

**Figure 37.** Figure 37: Seed stability: mean rMAE ±1 std over three seeds for seven benchmarks. 40 [PITH_FULL_IMAGE:figures/full_fig_p040_37.png] view at source ↗

read the original abstract

Solving time-dependent partial differential equations (PDEs) is an important problem in computational science and engineering. Physics-informed neural networks (PINNs) learn PDE solutions from governing equations. However, accurately capturing temporal evolution remains challenging. Recent sequence-model-based approaches parameterize time evolution using general-purpose sequence models, which capture temporal dependencies but do not explicitly encode the structured dynamics of PDE solutions. In addition, their memory requirements can scale unfavorably with sequence length and resolution, limiting applicability in large-scale or high-dimensional settings. This work introduces a PINN approach that incorporates oscillatory state-space dynamics to represent the modal structure of PDE solutions. The proposed method leverages a linear-oscillator-based temporal evolution, together with a PDE-aware spectral basis in space. This design enables closed-form spatial differentiation and consistent enforcement of boundary conditions. The method is evaluated on forward, inverse, and high-dimensional PDE problems, including cases up to 100 spatial dimensions. The results show improved accuracy and reduced memory usage compared to recent sequence-model-based PINN approaches. Overall, this work highlights the benefits of incorporating structured dynamical priors into the temporal evolution of neural PDE solvers and suggests designing more physics-aligned and computationally efficient PINN architectures.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper pairs linear oscillatory state-space models for time with PDE-aware spectral bases for space in PINNs, claiming better accuracy and lower memory than sequence-model baselines, but the temporal prior looks too narrow for many PDE regimes.

read the letter

The main thing here is a new architectural choice: instead of generic sequence models for the time dimension in PINNs, they use a linear-oscillator state-space model that is meant to capture modal temporal behavior, paired with a spectral spatial basis that allows exact differentiation and boundary handling. That combination is not just a routine swap; it directly targets the memory scaling problem that sequence models hit on long horizons or high dimensions.

What works is the explicit physics alignment on the spatial side. Closed-form derivatives and consistent boundaries are real advantages over black-box sequence approaches, and the abstract reports gains on forward, inverse, and up to 100-dimensional cases. If the experiments hold up with proper controls, this could be useful for problems where the solution really does behave like a set of linear oscillators.

The soft spot is exactly the one the stress test flags. Linear oscillators do not naturally produce damping, exponential decay, or chaotic evolution, and the abstract gives no sign of how the model is adjusted or regularized when the PDE is parabolic or strongly nonlinear. If the test problems were chosen to fit the oscillator assumption, the accuracy claims will not generalize. Without seeing the actual equations, training details, or ablation on non-oscillatory cases, it is hard to tell whether the reported improvements are robust or setup-dependent.

This is for people already working on sequence-model PINNs or structured priors for time-dependent problems. A reader who needs a drop-in replacement for long-horizon or high-dimensional PDEs might get value if the experiments are solid; otherwise it is mostly an idea paper. The work shows clear thinking about inductive biases, so it deserves a serious referee to check the implementation and the scope of the temporal model.

Referee Report

2 major / 0 minor

Summary. The manuscript proposes a physics-informed neural network (PINN) architecture that uses oscillatory state-space models to parameterize temporal evolution of PDE solutions, paired with a PDE-aware spectral basis for spatial discretization. This enables closed-form spatial derivatives and boundary condition enforcement. The approach is evaluated on forward, inverse, and high-dimensional (up to 100D) PDE problems and is reported to outperform recent sequence-model-based PINNs in accuracy while using less memory.

Significance. If the empirical gains hold under rigorous verification, the work would demonstrate the value of embedding structured linear dynamical priors into PINN temporal modules, offering a route to scalable solvers for high-dimensional time-dependent PDEs where general sequence models become memory-intensive.

major comments (2)

[Abstract] Abstract: The central claim that linear-oscillator state-space dynamics represent the modal temporal evolution 'without significant loss of fidelity' for the target PDEs is load-bearing, yet the abstract provides no indication of how the model is modified or regularized when the underlying PDE exhibits damping, exponential decay, or strong nonlinearity (e.g., parabolic or chaotic regimes).
[Abstract] Abstract: The reported accuracy and memory improvements are presented without reference to specific PDE families tested, sequence lengths, spatial resolutions, or quantitative baselines (error bars, number of runs), making it impossible to assess whether the gains are robust or confined to pre-selected oscillatory problems.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed comments on the abstract. We respond point by point below and will revise the abstract for clarity where appropriate.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim that linear-oscillator state-space dynamics represent the modal temporal evolution 'without significant loss of fidelity' for the target PDEs is load-bearing, yet the abstract provides no indication of how the model is modified or regularized when the underlying PDE exhibits damping, exponential decay, or strong nonlinearity (e.g., parabolic or chaotic regimes).

Authors: The manuscript positions the oscillatory state-space model as an inductive bias specifically for PDEs whose solutions exhibit modal oscillatory structure (see Introduction and Section 3). No explicit modification or regularization for damping/strong nonlinearity is introduced because the target problems are those where the linear oscillator prior aligns with the physics; applicability outside this regime is discussed as a limitation in the conclusion. We agree the abstract should better delimit scope and will revise it to state that the approach targets oscillatory modal evolution. revision: yes
Referee: [Abstract] Abstract: The reported accuracy and memory improvements are presented without reference to specific PDE families tested, sequence lengths, spatial resolutions, or quantitative baselines (error bars, number of runs), making it impossible to assess whether the gains are robust or confined to pre-selected oscillatory problems.

Authors: The abstract is intentionally high-level; concrete PDE families (wave, Schrödinger, etc.), sequence lengths, resolutions, and quantitative results (including error bars over multiple runs) appear in Section 4 and the associated tables/figures. We acknowledge that the abstract could better signal the breadth of evaluation and will revise it to name example PDE families and note that results include statistical quantification over repeated trials. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical method proposal with independent evaluation

full rationale

The paper introduces an oscillatory state-space model as an inductive bias for PINNs, combined with a spectral spatial basis. The abstract and description frame this as an architectural choice evaluated empirically on forward/inverse/high-dimensional PDE tasks, with reported gains in accuracy and memory. No equations, fitting procedures, or derivation steps are presented that reduce a claimed prediction or result to a fitted input, self-definition, or self-citation chain. The central claim remains an empirical improvement over sequence-model baselines rather than a self-referential derivation. This is a standard self-contained proposal; no load-bearing step collapses by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies no explicit free parameters, axioms, or invented entities; the central claim rests on the unstated premise that linear oscillators capture the modal structure of the target PDEs.

pith-pipeline@v0.9.1-grok · 5749 in / 1099 out tokens · 31904 ms · 2026-06-28T20:08:33.935453+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

51 extracted references · 29 canonical work pages · 1 internal anchor

[1]

, TITLE =

Lawrence C. Evans.Partial differential equations, volume 19 ofGraduate Studies in Mathemat- ics. American Mathematical Society, 2nd edition, 2010. doi: 10.1090/gsm/019

work page doi:10.1090/gsm/019 2010
[2]

Maziar Raissi, Paris Perdikaris, and George Em Karniadakis. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations.Journal of Computational Physics, 378:686–707, 2019. doi: 10.1016/j.jcp.2018.10.045

work page doi:10.1016/j.jcp.2018.10.045 2019
[3]

E., Kevrekidis, I

George Em Karniadakis, Ioannis G. Kevrekidis, Lu Lu, Paris Perdikaris, Sifan Wang, and Liu Yang. Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021. doi: 10.1038/s42254-021-00314-5

work page doi:10.1038/s42254-021-00314-5 2021
[4]

Molina Catricheo, Fabrice Lambert, Julien Salomon, and Elwin van’t Wout

Constanza A. Molina Catricheo, Fabrice Lambert, Julien Salomon, and Elwin van’t Wout. Mod- eling global surface dust deposition using physics-informed neural networks.Communications Earth & Environment, 5(1):778, 2024. doi: 10.1038/s43247-024-01942-2

work page doi:10.1038/s43247-024-01942-2 2024
[5]

PhD thesis, University of Oxford, 2022

Benjamin Moseley.Physics-informed machine learning: From concepts to real-world applica- tions. PhD thesis, University of Oxford, 2022

2022
[6]

Aditi Krishnapriyan, Amir Gholami, Shandian Zhe, Robert Kirby, and Michael W. Mahoney. Characterizing possible failure modes in physics-informed neural net- works. InAdvances in Neural Information Processing Systems, volume 34, pages 26548–26560, 2021. URL https://proceedings.neurips.cc/paper/2021/hash/ df438e5206f31600e6ae4af72f2725f1-Abstract.html

2021
[7]

Understanding and mitigating gradient flow pathologies in physics-informed neural networks.SIAM Journal on Scientific Computing, 43 (5):A3055–A3081, 2021

Sifan Wang, Yujun Teng, and Paris Perdikaris. Understanding and mitigating gradient flow pathologies in physics-informed neural networks.SIAM Journal on Scientific Computing, 43 (5):A3055–A3081, 2021. doi: 10.1137/20M1318043

work page doi:10.1137/20m1318043 2021
[8]

When and why PINNs fail to train: A neural tangent kernel perspective.Journal of Computational Physics, 449:110768, 2022

Sifan Wang, Xinling Yu, and Paris Perdikaris. When and why PINNs fail to train: A neural tangent kernel perspective.Journal of Computational Physics, 449:110768, 2022. doi: 10.1016/ j.jcp.2021.110768

work page arXiv 2022
[9]

Physics-informed neu- ral networks for solving forward and inverse problems in complex beam systems.IEEE Transactions on Neural Networks and Learning Systems, 35(5):5981–5995, 2023

Taniya Kapoor, Hongrui Wang, Alfredo Núñez, and Rolf Dollevoet. Physics-informed neu- ral networks for solving forward and inverse problems in complex beam systems.IEEE Transactions on Neural Networks and Learning Systems, 35(5):5981–5995, 2023. doi: 10.1109/TNNLS.2023.3310585

work page doi:10.1109/tnnls.2023.3310585 2023
[10]

Challenges in training PINNs: A loss landscape perspective

Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, and Madeleine Udell. Challenges in training PINNs: A loss landscape perspective. InInternational Conference on Machine Learning, pages 42384–42409. PMLR, 2024. URL https://proceedings.mlr.press/ v235/rathore24a.html

2024
[11]

Respecting causality for training physics- informed neural networks.Computer Methods in Applied Mechanics and Engineering, 421: 116813, 2024

Sifan Wang, Shyam Sankaran, and Paris Perdikaris. Respecting causality for training physics- informed neural networks.Computer Methods in Applied Mechanics and Engineering, 421: 116813, 2024. doi: 10.1016/j.cma.2024.116813

work page doi:10.1016/j.cma.2024.116813 2024
[12]

Wight and Jia Zhao

Colby L. Wight and Jia Zhao. Solving Allen-Cahn and Cahn-Hilliard equations using the adaptive physics informed neural networks.Communications in Computational Physics, 29(3): 930–954, 2021. doi: 10.4208/cicp.OA-2020-0086

work page doi:10.4208/cicp.oa-2020-0086 2021
[13]

Revanth Mattey and Susanta Ghosh. A novel sequential method to train physics informed neural networks for Allen-Cahn and Cahn-Hilliard equations.Computer Methods in Applied Mechanics and Engineering, 390:114474, 2022. doi: 10.1016/j.cma.2021.114474

work page doi:10.1016/j.cma.2021.114474 2022
[14]

Jagtap, Shandian Zhe, George Em Karniadakis, and Robert M

Michael Penwarden, Ameya D. Jagtap, Shandian Zhe, George Em Karniadakis, and Robert M. Kirby. A unified scalable framework for causal sweeping strategies for physics-informed neural networks (PINNs) and their temporal decompositions.Journal of Computational Physics, 493: 112464, 2023. doi: 10.1016/j.jcp.2023.112464. 10

work page doi:10.1016/j.jcp.2023.112464 2023
[15]

PPINN: Parareal physics- informed neural network for time-dependent PDEs.Computer Methods in Applied Mechanics and Engineering, 370:113250, 2020

Xuhui Meng, Zhen Li, Dongkun Zhang, and George Em Karniadakis. PPINN: Parareal physics- informed neural network for time-dependent PDEs.Computer Methods in Applied Mechanics and Engineering, 370:113250, 2020. doi: 10.1016/j.cma.2020.113250

work page doi:10.1016/j.cma.2020.113250 2020
[16]

Aditya Prakash

Zhiyuan Zhao, Xueying Ding, and B. Aditya Prakash. PINNsFormer: A transformer-based framework for physics-informed neural networks. InThe Twelfth International Conference on Learning Representations, 2024. URL https://openreview.net/forum?id=DO2WFXU1Be

2024
[17]

Sub-sequential physics-informed learning with state space model

Chenhui Xu, Dancheng Liu, Yuting Hu, Jiajie Li, Ruiyang Qin, Qingxiao Zheng, and Jinjun Xiong. Sub-sequential physics-informed learning with state space model. InInternational Conference on Machine Learning, 2025. URL https://icml.cc/virtual/2025/poster/ 45079

2025
[18]

ML-PINN: A memory-efficient physics- informed Mamba-LSTM network for fast and accurate PDE solving.Neurocomputing, page 131446, 2025

YiMing Gao, Bing Wang, Jingyi Lu, and Zhou Tian. ML-PINN: A memory-efficient physics- informed Mamba-LSTM network for fast and accurate PDE solving.Neurocomputing, page 131446, 2025. doi: 10.1016/j.neucom.2025.131446

work page doi:10.1016/j.neucom.2025.131446 2025
[19]

Moreira, Márcio Marques, Leonardo Mendonça, Christian Júnior de Oliveira, Vitor Balestro, Lucas dos Santos Fernandez, Daniel Yukimura, Pavel Petrov, João M

Arthur Bizzi, Leonardo M. Moreira, Márcio Marques, Leonardo Mendonça, Christian Júnior de Oliveira, Vitor Balestro, Lucas dos Santos Fernandez, Daniel Yukimura, Pavel Petrov, João M. Pereira, et al. Neuro-spectral architectures for causal physics-informed networks. InAdvances in Neural Information Processing Systems, volume 38, 2025. URL https: //neurips....

2025
[20]

Konstantin Rusch and Daniela Rus

T. Konstantin Rusch and Daniela Rus. Oscillatory state-space models. InThe Thirteenth International Conference on Learning Representations, 2025. URL https://openreview. net/forum?id=GRMfXcAAFh

2025
[21]

Blelloch

Guy E. Blelloch. Prefix sums and their applications. Technical Report CMU-CS-90-190, Carnegie Mellon University, School of Computer Science, 1990

1990
[22]

Improvements of the daily optimum interpolation sea surface temperature (DOISST) version 2.1.Journal of Climate, 34(8):2923–2939, 2021

Boyin Huang, Chunying Liu, Viva Banzon, Eric Freeman, Garrett Graham, Bill Hankins, Tom Smith, and Huai-Min Zhang. Improvements of the daily optimum interpolation sea surface temperature (DOISST) version 2.1.Journal of Climate, 34(8):2923–2939, 2021. doi: 10.1175/JCLI-D-20-0166.1

work page doi:10.1175/jcli-d-20-0166.1 2021
[23]

Gradient-enhanced physics- informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022

Jeremy Yu, Lu Lu, Xuhui Meng, and George Em Karniadakis. Gradient-enhanced physics- informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022. doi: 10.1016/j.cma.2022.114823

work page doi:10.1016/j.cma.2022.114823 2022
[24]

Seidman, and Paris Perdikaris

Sifan Wang, Hanwen Wang, Jacob H. Seidman, and Paris Perdikaris. Random weight factorization improves the training of continuous neural representations.arXiv preprint arXiv:2210.01274, 2022. URLhttps://arxiv.org/abs/2210.01274

work page arXiv 2022
[25]

Urbán, Jérôme Darbon, and George Em Karniadakis

Elham Kiyani, Khemraj Shukla, Jorge F. Urbán, Jérôme Darbon, and George Em Karniadakis. Optimizing the optimizer for physics-informed neural networks and Kolmogorov–Arnold networks.Computer Methods in Applied Mechanics and Engineering, 446:118308, 2025. doi: 10.1016/j.cma.2025.118308

work page doi:10.1016/j.cma.2025.118308 2025
[26]

Lagaris, A

Isaac E. Lagaris, Aristidis Likas, and Dimitrios I. Fotiadis. Artificial neural networks for solving ordinary and partial differential equations.IEEE Transactions on Neural Networks, 9 (5):987–1000, 1998. doi: 10.1109/72.712178

work page doi:10.1109/72.712178 1998
[27]

A unified deep artificial neural network approach to partial differential equations in complex geometries.Neurocomputing, 317:28–41, 2018

Jens Berg and Kaj Nyström. A unified deep artificial neural network approach to partial differential equations in complex geometries.Neurocomputing, 317:28–41, 2018. doi: 10.1016/ j.neucom.2018.06.056

2018
[28]

DeepXDE: A deep learning library for solving differential equations.SIAM Review, 63(1):208–228, 2021

Lu Lu, Xuhui Meng, Zhiping Mao, and George Em Karniadakis. DeepXDE: A deep learning library for solving differential equations.SIAM Review, 63(1):208–228, 2021. doi: 10.1137/ 19M1274067

2021
[29]

Chenxi Wu, Min Zhu, Qinyang Tan, Yadhu Kartha, and Lu Lu. A comprehensive study of non- adaptive and residual-based adaptive sampling for physics-informed neural networks.Computer Methods in Applied Mechanics and Engineering, 403:115671, 2023. doi: 10.1016/j.cma.2022. 115671. 11

work page doi:10.1016/j.cma.2022 2023
[30]

Mitigating propagation failures in physics-informed neural networks using retain-resample-release (R3) sampling

Arka Daw, Jie Bu, Sifan Wang, Paris Perdikaris, and Anuj Karpatne. Mitigating propagation failures in physics-informed neural networks using retain-resample-release (R3) sampling. InInternational Conference on Machine Learning, pages 7264–7302. PMLR, 2023. URL https://proceedings.mlr.press/v202/daw23a.html

2023
[31]

Failure-informed adaptive sampling for PINNs.SIAM Journal on Scientific Computing, 45(4):A1971–A1994, 2023

Zhiping Gao, Liang Yan, and Tao Zhou. Failure-informed adaptive sampling for PINNs.SIAM Journal on Scientific Computing, 45(4):A1971–A1994, 2023. doi: 10.1137/22M1527763

work page doi:10.1137/22m1527763 2023
[32]

Yifan Du and Tamer A. Zaki. Evolutional deep neural network.Physical Review E, 104(4): 045303, 2021. doi: 10.1103/PhysRevE.104.045303

work page doi:10.1103/physreve.104.045303 2021
[33]

Mamba: Linear-time sequence modeling with selective state spaces

Albert Gu and Tri Dao. Mamba: Linear-time sequence modeling with selective state spaces. InFirst Conference on Language Modeling, 2024. URL https://openreview.net/forum? id=tEYskw1VY2

2024
[34]

Efficiently modeling long sequences with structured state spaces

Albert Gu, Karan Goel, and Christopher Ré. Efficiently modeling long sequences with structured state spaces. InThe Tenth International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=uYLFoz1vlAC

2022
[35]

Jimmy T. H. Smith, Andrew Warrington, and Scott W. Linderman. Simplified state space layers for sequence modeling. InThe Eleventh International Conference on Learning Representations,
[36]

URLhttps://openreview.net/forum?id=Ai8Hw3AXqks
[37]

Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, and Soham De

Antonio Orvieto, Samuel L. Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, and Soham De. Resurrecting recurrent neural networks for long sequences. In International Conference on Machine Learning, pages 26670–26698. PMLR, 2023. URL https://proceedings.mlr.press/v202/orvieto23a.html

2023
[38]

Fourier neural operator for parametric partial dif- ferential equations

Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Fourier neural operator for parametric partial dif- ferential equations. InInternational Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=c8P9NQVtmnO

2021
[39]

doi:10.1038/s42256-021-00302-5 Lu Lu, Raphaël Pestourie, Steven G

Lu Lu, Pengzhan Jin, Guofei Pang, Zhongqiang Zhang, and George Em Karniadakis. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature Machine Intelligence, 3(3):218–229, 2021. doi: 10.1038/s42256-021-00302-5

work page doi:10.1038/s42256-021-00302-5 2021
[40]

Neural Operator: Graph Kernel Network for Partial Differential Equations

Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Neural operator: Graph kernel network for partial differential equations.arXiv preprint arXiv:2003.03485, 2020. URL https://arxiv.org/ abs/2003.03485

work page internal anchor Pith review Pith/arXiv arXiv 2003
[41]

Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40): eabi8605, 2021

Sifan Wang, Hanwen Wang, and Paris Perdikaris. Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40): eabi8605, 2021. doi: 10.1126/sciadv.abi8605

work page doi:10.1126/sciadv.abi8605 2021
[42]

Fanaskov and Ivan V

Vladimir S. Fanaskov and Ivan V . Oseledets. Spectral neural operators.Doklady Mathematics, 108(Suppl 2):S226–S232, 2023. doi: 10.1134/S1064562423701107

work page doi:10.1134/s1064562423701107 2023
[43]

Jagtap, Ehsan Kharazmi, and George Em Karniadakis

Ameya D. Jagtap, Ehsan Kharazmi, and George Em Karniadakis. Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems.Computer Methods in Applied Mechanics and Engineering, 365:113028, 2020. doi: 10.1016/j.cma.2020.113028

work page doi:10.1016/j.cma.2020.113028 2020
[44]

Jagtap and George Em Karniadakis

Ameya D. Jagtap and George Em Karniadakis. Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations.Communications in Computational Physics, 28(5): 2002–2041, 2020. doi: 10.4208/cicp.OA-2020-0164

work page doi:10.4208/cicp.oa-2020-0164 2002
[45]

doi:10.1016/j

Sifan Wang, Hanwen Wang, and Paris Perdikaris. On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering, 384:113938, 2021. doi: 10.1016/j. cma.2021.113938. 12

work page doi:10.1016/j 2021
[46]

Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 33: 6696–6707, 2020

Patrick Kidger, James Morrill, James Foster, and Terry Lyons. Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 33: 6696–6707, 2020

2020
[47]

Neural rough differential equations for long time series

James Morrill, Cristopher Salvi, Patrick Kidger, and James Foster. Neural rough differential equations for long time series. InInternational Conference on Machine Learning, pages 7829–7838. PMLR, 2021

2021
[48]

Log neural controlled differential equations: The lie brackets make a difference.arXiv preprint arXiv:2402.18512, 2024

Benjamin Walker, Andrew D McLeod, Tiexin Qin, Yichuan Cheng, Haoliang Li, and Terry Lyons. Log neural controlled differential equations: The lie brackets make a difference.arXiv preprint arXiv:2402.18512, 2024. 13 Supplementary Material Contents §A Extended Related Work p. 14 §B OSSM-PINN Pseudocode p. 16 §C Experiment Overview and Benchmark Suite p. 16 §...

work page arXiv 2024
[49]

Wang et al

traced failures to imbalanced gradient flow between loss terms. Wang et al. [8] provided a neural tangent kernel analysis revealing spectral bias toward low frequencies, and Rathore et al
[50]

These analyses motivate two complementary directions: improving the training procedure (loss weighting, optimizers, sampling) and improving the architecture

showed that PINN loss surfaces contain narrow valleys with ill-conditioned curvature. These analyses motivate two complementary directions: improving the training procedure (loss weighting, optimizers, sampling) and improving the architecture. A.2 Training Improvements: Weighting, Optimization, and Sampling Adaptive loss weighting [7] adjusts the balance ...

2008
[51]

Using the Hermite basis reduces QHO rMAE from 9.5×10 −3 to 1.9×10 −4, a 50× improvement at no architectural cost (Figure 29). I.0.2 Nonlinear Schrödinger Equation We consider the cubic NLS iψt + 1 2 ψxx +|ψ| 2ψ= 0 on (x, t)∈[−5,5]×[0, π/2] with periodic boundary conditions (ψandψ x matched atx=±5) and the soliton-like initial condition ψ(x,0) = 2 sech(x),...

[1] [1]

, TITLE =

Lawrence C. Evans.Partial differential equations, volume 19 ofGraduate Studies in Mathemat- ics. American Mathematical Society, 2nd edition, 2010. doi: 10.1090/gsm/019

work page doi:10.1090/gsm/019 2010

[2] [2]

Maziar Raissi, Paris Perdikaris, and George Em Karniadakis. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations.Journal of Computational Physics, 378:686–707, 2019. doi: 10.1016/j.jcp.2018.10.045

work page doi:10.1016/j.jcp.2018.10.045 2019

[3] [3]

E., Kevrekidis, I

George Em Karniadakis, Ioannis G. Kevrekidis, Lu Lu, Paris Perdikaris, Sifan Wang, and Liu Yang. Physics-informed machine learning.Nature Reviews Physics, 3(6):422–440, 2021. doi: 10.1038/s42254-021-00314-5

work page doi:10.1038/s42254-021-00314-5 2021

[4] [4]

Molina Catricheo, Fabrice Lambert, Julien Salomon, and Elwin van’t Wout

Constanza A. Molina Catricheo, Fabrice Lambert, Julien Salomon, and Elwin van’t Wout. Mod- eling global surface dust deposition using physics-informed neural networks.Communications Earth & Environment, 5(1):778, 2024. doi: 10.1038/s43247-024-01942-2

work page doi:10.1038/s43247-024-01942-2 2024

[5] [5]

PhD thesis, University of Oxford, 2022

Benjamin Moseley.Physics-informed machine learning: From concepts to real-world applica- tions. PhD thesis, University of Oxford, 2022

2022

[6] [6]

Aditi Krishnapriyan, Amir Gholami, Shandian Zhe, Robert Kirby, and Michael W. Mahoney. Characterizing possible failure modes in physics-informed neural net- works. InAdvances in Neural Information Processing Systems, volume 34, pages 26548–26560, 2021. URL https://proceedings.neurips.cc/paper/2021/hash/ df438e5206f31600e6ae4af72f2725f1-Abstract.html

2021

[7] [7]

Understanding and mitigating gradient flow pathologies in physics-informed neural networks.SIAM Journal on Scientific Computing, 43 (5):A3055–A3081, 2021

Sifan Wang, Yujun Teng, and Paris Perdikaris. Understanding and mitigating gradient flow pathologies in physics-informed neural networks.SIAM Journal on Scientific Computing, 43 (5):A3055–A3081, 2021. doi: 10.1137/20M1318043

work page doi:10.1137/20m1318043 2021

[8] [8]

When and why PINNs fail to train: A neural tangent kernel perspective.Journal of Computational Physics, 449:110768, 2022

Sifan Wang, Xinling Yu, and Paris Perdikaris. When and why PINNs fail to train: A neural tangent kernel perspective.Journal of Computational Physics, 449:110768, 2022. doi: 10.1016/ j.jcp.2021.110768

work page arXiv 2022

[9] [9]

Physics-informed neu- ral networks for solving forward and inverse problems in complex beam systems.IEEE Transactions on Neural Networks and Learning Systems, 35(5):5981–5995, 2023

Taniya Kapoor, Hongrui Wang, Alfredo Núñez, and Rolf Dollevoet. Physics-informed neu- ral networks for solving forward and inverse problems in complex beam systems.IEEE Transactions on Neural Networks and Learning Systems, 35(5):5981–5995, 2023. doi: 10.1109/TNNLS.2023.3310585

work page doi:10.1109/tnnls.2023.3310585 2023

[10] [10]

Challenges in training PINNs: A loss landscape perspective

Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, and Madeleine Udell. Challenges in training PINNs: A loss landscape perspective. InInternational Conference on Machine Learning, pages 42384–42409. PMLR, 2024. URL https://proceedings.mlr.press/ v235/rathore24a.html

2024

[11] [11]

Respecting causality for training physics- informed neural networks.Computer Methods in Applied Mechanics and Engineering, 421: 116813, 2024

Sifan Wang, Shyam Sankaran, and Paris Perdikaris. Respecting causality for training physics- informed neural networks.Computer Methods in Applied Mechanics and Engineering, 421: 116813, 2024. doi: 10.1016/j.cma.2024.116813

work page doi:10.1016/j.cma.2024.116813 2024

[12] [12]

Wight and Jia Zhao

Colby L. Wight and Jia Zhao. Solving Allen-Cahn and Cahn-Hilliard equations using the adaptive physics informed neural networks.Communications in Computational Physics, 29(3): 930–954, 2021. doi: 10.4208/cicp.OA-2020-0086

work page doi:10.4208/cicp.oa-2020-0086 2021

[13] [13]

Revanth Mattey and Susanta Ghosh. A novel sequential method to train physics informed neural networks for Allen-Cahn and Cahn-Hilliard equations.Computer Methods in Applied Mechanics and Engineering, 390:114474, 2022. doi: 10.1016/j.cma.2021.114474

work page doi:10.1016/j.cma.2021.114474 2022

[14] [14]

Jagtap, Shandian Zhe, George Em Karniadakis, and Robert M

Michael Penwarden, Ameya D. Jagtap, Shandian Zhe, George Em Karniadakis, and Robert M. Kirby. A unified scalable framework for causal sweeping strategies for physics-informed neural networks (PINNs) and their temporal decompositions.Journal of Computational Physics, 493: 112464, 2023. doi: 10.1016/j.jcp.2023.112464. 10

work page doi:10.1016/j.jcp.2023.112464 2023

[15] [15]

PPINN: Parareal physics- informed neural network for time-dependent PDEs.Computer Methods in Applied Mechanics and Engineering, 370:113250, 2020

Xuhui Meng, Zhen Li, Dongkun Zhang, and George Em Karniadakis. PPINN: Parareal physics- informed neural network for time-dependent PDEs.Computer Methods in Applied Mechanics and Engineering, 370:113250, 2020. doi: 10.1016/j.cma.2020.113250

work page doi:10.1016/j.cma.2020.113250 2020

[16] [16]

Aditya Prakash

Zhiyuan Zhao, Xueying Ding, and B. Aditya Prakash. PINNsFormer: A transformer-based framework for physics-informed neural networks. InThe Twelfth International Conference on Learning Representations, 2024. URL https://openreview.net/forum?id=DO2WFXU1Be

2024

[17] [17]

Sub-sequential physics-informed learning with state space model

Chenhui Xu, Dancheng Liu, Yuting Hu, Jiajie Li, Ruiyang Qin, Qingxiao Zheng, and Jinjun Xiong. Sub-sequential physics-informed learning with state space model. InInternational Conference on Machine Learning, 2025. URL https://icml.cc/virtual/2025/poster/ 45079

2025

[18] [18]

ML-PINN: A memory-efficient physics- informed Mamba-LSTM network for fast and accurate PDE solving.Neurocomputing, page 131446, 2025

YiMing Gao, Bing Wang, Jingyi Lu, and Zhou Tian. ML-PINN: A memory-efficient physics- informed Mamba-LSTM network for fast and accurate PDE solving.Neurocomputing, page 131446, 2025. doi: 10.1016/j.neucom.2025.131446

work page doi:10.1016/j.neucom.2025.131446 2025

[19] [19]

Moreira, Márcio Marques, Leonardo Mendonça, Christian Júnior de Oliveira, Vitor Balestro, Lucas dos Santos Fernandez, Daniel Yukimura, Pavel Petrov, João M

Arthur Bizzi, Leonardo M. Moreira, Márcio Marques, Leonardo Mendonça, Christian Júnior de Oliveira, Vitor Balestro, Lucas dos Santos Fernandez, Daniel Yukimura, Pavel Petrov, João M. Pereira, et al. Neuro-spectral architectures for causal physics-informed networks. InAdvances in Neural Information Processing Systems, volume 38, 2025. URL https: //neurips....

2025

[20] [20]

Konstantin Rusch and Daniela Rus

T. Konstantin Rusch and Daniela Rus. Oscillatory state-space models. InThe Thirteenth International Conference on Learning Representations, 2025. URL https://openreview. net/forum?id=GRMfXcAAFh

2025

[21] [21]

Blelloch

Guy E. Blelloch. Prefix sums and their applications. Technical Report CMU-CS-90-190, Carnegie Mellon University, School of Computer Science, 1990

1990

[22] [22]

Improvements of the daily optimum interpolation sea surface temperature (DOISST) version 2.1.Journal of Climate, 34(8):2923–2939, 2021

Boyin Huang, Chunying Liu, Viva Banzon, Eric Freeman, Garrett Graham, Bill Hankins, Tom Smith, and Huai-Min Zhang. Improvements of the daily optimum interpolation sea surface temperature (DOISST) version 2.1.Journal of Climate, 34(8):2923–2939, 2021. doi: 10.1175/JCLI-D-20-0166.1

work page doi:10.1175/jcli-d-20-0166.1 2021

[23] [23]

Gradient-enhanced physics- informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022

Jeremy Yu, Lu Lu, Xuhui Meng, and George Em Karniadakis. Gradient-enhanced physics- informed neural networks for forward and inverse PDE problems.Computer Methods in Applied Mechanics and Engineering, 393:114823, 2022. doi: 10.1016/j.cma.2022.114823

work page doi:10.1016/j.cma.2022.114823 2022

[24] [24]

Seidman, and Paris Perdikaris

Sifan Wang, Hanwen Wang, Jacob H. Seidman, and Paris Perdikaris. Random weight factorization improves the training of continuous neural representations.arXiv preprint arXiv:2210.01274, 2022. URLhttps://arxiv.org/abs/2210.01274

work page arXiv 2022

[25] [25]

Urbán, Jérôme Darbon, and George Em Karniadakis

Elham Kiyani, Khemraj Shukla, Jorge F. Urbán, Jérôme Darbon, and George Em Karniadakis. Optimizing the optimizer for physics-informed neural networks and Kolmogorov–Arnold networks.Computer Methods in Applied Mechanics and Engineering, 446:118308, 2025. doi: 10.1016/j.cma.2025.118308

work page doi:10.1016/j.cma.2025.118308 2025

[26] [26]

Lagaris, A

Isaac E. Lagaris, Aristidis Likas, and Dimitrios I. Fotiadis. Artificial neural networks for solving ordinary and partial differential equations.IEEE Transactions on Neural Networks, 9 (5):987–1000, 1998. doi: 10.1109/72.712178

work page doi:10.1109/72.712178 1998

[27] [27]

A unified deep artificial neural network approach to partial differential equations in complex geometries.Neurocomputing, 317:28–41, 2018

Jens Berg and Kaj Nyström. A unified deep artificial neural network approach to partial differential equations in complex geometries.Neurocomputing, 317:28–41, 2018. doi: 10.1016/ j.neucom.2018.06.056

2018

[28] [28]

DeepXDE: A deep learning library for solving differential equations.SIAM Review, 63(1):208–228, 2021

Lu Lu, Xuhui Meng, Zhiping Mao, and George Em Karniadakis. DeepXDE: A deep learning library for solving differential equations.SIAM Review, 63(1):208–228, 2021. doi: 10.1137/ 19M1274067

2021

[29] [29]

Chenxi Wu, Min Zhu, Qinyang Tan, Yadhu Kartha, and Lu Lu. A comprehensive study of non- adaptive and residual-based adaptive sampling for physics-informed neural networks.Computer Methods in Applied Mechanics and Engineering, 403:115671, 2023. doi: 10.1016/j.cma.2022. 115671. 11

work page doi:10.1016/j.cma.2022 2023

[30] [30]

Mitigating propagation failures in physics-informed neural networks using retain-resample-release (R3) sampling

Arka Daw, Jie Bu, Sifan Wang, Paris Perdikaris, and Anuj Karpatne. Mitigating propagation failures in physics-informed neural networks using retain-resample-release (R3) sampling. InInternational Conference on Machine Learning, pages 7264–7302. PMLR, 2023. URL https://proceedings.mlr.press/v202/daw23a.html

2023

[31] [31]

Failure-informed adaptive sampling for PINNs.SIAM Journal on Scientific Computing, 45(4):A1971–A1994, 2023

Zhiping Gao, Liang Yan, and Tao Zhou. Failure-informed adaptive sampling for PINNs.SIAM Journal on Scientific Computing, 45(4):A1971–A1994, 2023. doi: 10.1137/22M1527763

work page doi:10.1137/22m1527763 2023

[32] [32]

Yifan Du and Tamer A. Zaki. Evolutional deep neural network.Physical Review E, 104(4): 045303, 2021. doi: 10.1103/PhysRevE.104.045303

work page doi:10.1103/physreve.104.045303 2021

[33] [33]

Mamba: Linear-time sequence modeling with selective state spaces

Albert Gu and Tri Dao. Mamba: Linear-time sequence modeling with selective state spaces. InFirst Conference on Language Modeling, 2024. URL https://openreview.net/forum? id=tEYskw1VY2

2024

[34] [34]

Efficiently modeling long sequences with structured state spaces

Albert Gu, Karan Goel, and Christopher Ré. Efficiently modeling long sequences with structured state spaces. InThe Tenth International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=uYLFoz1vlAC

2022

[35] [35]

Jimmy T. H. Smith, Andrew Warrington, and Scott W. Linderman. Simplified state space layers for sequence modeling. InThe Eleventh International Conference on Learning Representations,

[36] [36]

URLhttps://openreview.net/forum?id=Ai8Hw3AXqks

[37] [37]

Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, and Soham De

Antonio Orvieto, Samuel L. Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, and Soham De. Resurrecting recurrent neural networks for long sequences. In International Conference on Machine Learning, pages 26670–26698. PMLR, 2023. URL https://proceedings.mlr.press/v202/orvieto23a.html

2023

[38] [38]

Fourier neural operator for parametric partial dif- ferential equations

Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Fourier neural operator for parametric partial dif- ferential equations. InInternational Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=c8P9NQVtmnO

2021

[39] [39]

doi:10.1038/s42256-021-00302-5 Lu Lu, Raphaël Pestourie, Steven G

Lu Lu, Pengzhan Jin, Guofei Pang, Zhongqiang Zhang, and George Em Karniadakis. Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature Machine Intelligence, 3(3):218–229, 2021. doi: 10.1038/s42256-021-00302-5

work page doi:10.1038/s42256-021-00302-5 2021

[40] [40]

Neural Operator: Graph Kernel Network for Partial Differential Equations

Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, Burigede Liu, Kaushik Bhattacharya, Andrew Stuart, and Anima Anandkumar. Neural operator: Graph kernel network for partial differential equations.arXiv preprint arXiv:2003.03485, 2020. URL https://arxiv.org/ abs/2003.03485

work page internal anchor Pith review Pith/arXiv arXiv 2003

[41] [41]

Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40): eabi8605, 2021

Sifan Wang, Hanwen Wang, and Paris Perdikaris. Learning the solution operator of parametric partial differential equations with physics-informed DeepONets.Science Advances, 7(40): eabi8605, 2021. doi: 10.1126/sciadv.abi8605

work page doi:10.1126/sciadv.abi8605 2021

[42] [42]

Fanaskov and Ivan V

Vladimir S. Fanaskov and Ivan V . Oseledets. Spectral neural operators.Doklady Mathematics, 108(Suppl 2):S226–S232, 2023. doi: 10.1134/S1064562423701107

work page doi:10.1134/s1064562423701107 2023

[43] [43]

Jagtap, Ehsan Kharazmi, and George Em Karniadakis

Ameya D. Jagtap, Ehsan Kharazmi, and George Em Karniadakis. Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems.Computer Methods in Applied Mechanics and Engineering, 365:113028, 2020. doi: 10.1016/j.cma.2020.113028

work page doi:10.1016/j.cma.2020.113028 2020

[44] [44]

Jagtap and George Em Karniadakis

Ameya D. Jagtap and George Em Karniadakis. Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations.Communications in Computational Physics, 28(5): 2002–2041, 2020. doi: 10.4208/cicp.OA-2020-0164

work page doi:10.4208/cicp.oa-2020-0164 2002

[45] [45]

doi:10.1016/j

Sifan Wang, Hanwen Wang, and Paris Perdikaris. On the eigenvector bias of Fourier feature networks: From regression to solving multi-scale PDEs with physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering, 384:113938, 2021. doi: 10.1016/j. cma.2021.113938. 12

work page doi:10.1016/j 2021

[46] [46]

Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 33: 6696–6707, 2020

Patrick Kidger, James Morrill, James Foster, and Terry Lyons. Neural controlled differential equations for irregular time series.Advances in neural information processing systems, 33: 6696–6707, 2020

2020

[47] [47]

Neural rough differential equations for long time series

James Morrill, Cristopher Salvi, Patrick Kidger, and James Foster. Neural rough differential equations for long time series. InInternational Conference on Machine Learning, pages 7829–7838. PMLR, 2021

2021

[48] [48]

Log neural controlled differential equations: The lie brackets make a difference.arXiv preprint arXiv:2402.18512, 2024

Benjamin Walker, Andrew D McLeod, Tiexin Qin, Yichuan Cheng, Haoliang Li, and Terry Lyons. Log neural controlled differential equations: The lie brackets make a difference.arXiv preprint arXiv:2402.18512, 2024. 13 Supplementary Material Contents §A Extended Related Work p. 14 §B OSSM-PINN Pseudocode p. 16 §C Experiment Overview and Benchmark Suite p. 16 §...

work page arXiv 2024

[49] [49]

Wang et al

traced failures to imbalanced gradient flow between loss terms. Wang et al. [8] provided a neural tangent kernel analysis revealing spectral bias toward low frequencies, and Rathore et al

[50] [50]

These analyses motivate two complementary directions: improving the training procedure (loss weighting, optimizers, sampling) and improving the architecture

showed that PINN loss surfaces contain narrow valleys with ill-conditioned curvature. These analyses motivate two complementary directions: improving the training procedure (loss weighting, optimizers, sampling) and improving the architecture. A.2 Training Improvements: Weighting, Optimization, and Sampling Adaptive loss weighting [7] adjusts the balance ...

2008

[51] [51]

Using the Hermite basis reduces QHO rMAE from 9.5×10 −3 to 1.9×10 −4, a 50× improvement at no architectural cost (Figure 29). I.0.2 Nonlinear Schrödinger Equation We consider the cubic NLS iψt + 1 2 ψxx +|ψ| 2ψ= 0 on (x, t)∈[−5,5]×[0, π/2] with periodic boundary conditions (ψandψ x matched atx=±5) and the soliton-like initial condition ψ(x,0) = 2 sech(x),...