pith. sign in

arxiv: 2606.18175 · v1 · pith:2HB4APD4new · submitted 2026-06-16 · 🧮 math.NA · cs.LG· cs.NA· physics.comp-ph

A Convex Quasilinearization Method for Solving Nonlinear PDEs with Physics-Informed Neural Networks

Pith reviewed 2026-06-26 23:42 UTC · model grok-4.3

classification 🧮 math.NA cs.LGcs.NAphysics.comp-ph
keywords quasilinearizationphysics-informed neural networksnonlinear PDEsleast-squares collocationNewton-Kantorovich convergencerandom feature modelsconvex optimization
0
0 comments X

The pith

Quasilinearization converts nonlinear PDEs into sequences of linear least-squares problems solved directly on linear-in-parameters trial spaces.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces the LiL-Q method for solving nonlinear PDEs. It uses Bellman-Kalaba quasilinearization to break the nonlinear problem into a sequence of linear subproblems. Each linear subproblem is discretized by collocation on a trial space where the parameters appear linearly, such as random-feature networks or polynomial bases, and solved exactly with a QR factorization. This replaces the nonconvex gradient descent of standard PINNs with convex linear algebra. A convergence analysis shows local convergence under a smallness condition on the initial residual, with the final accuracy determined by how well the trial space approximates the true solution rather than by solver tolerances. The method is tested on seven benchmark problems including Burgers equation, Navier-Stokes, and Darcy flow, typically converging in a handful of outer iterations.

Core claim

The central discovery is that Bellman-Kalaba quasilinearization combined with Linear-in-Learnables trial spaces allows the forward solution of nonlinear PDEs through a short sequence of convex linear least-squares solves whose accuracy is limited solely by the best-approximation residual of the chosen trial space.

What carries the argument

Linear-in-Learnables (LiL) trial spaces, which are representations whose trainable parameters enter linearly into the model, solved by direct QR factorization after quasilinearization reduces the nonlinear PDE to linear subproblems.

If this is right

  • When the exact solution lies exactly in the trial space, the method recovers it to machine precision in a single solve.
  • The outer iteration converges in single-digit steps for most benchmarks, independent of the number of parameters.
  • On the incompressible Navier-Stokes equations the approach matches or exceeds published PINN results while using up to two orders of magnitude fewer trainable parameters.
  • The limiting accuracy is set by the approximation power of the trial space, not by the optimization procedure.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The approach may allow stable solutions for stiff nonlinear problems where gradient-based methods struggle with local minima.
  • Extending the LiL spaces to include adaptive or hierarchical bases could further improve efficiency for high-dimensional problems.
  • Since each step is a convex solve, the method could be combined with uncertainty quantification techniques that rely on linear algebra.

Load-bearing premise

The quasilinearization iteration is guaranteed to converge only when the initial guess satisfies an explicit smallness condition on the residual norm.

What would settle it

Demonstrating failure to converge for an initial guess that violates the smallness condition in the Newton-Kantorovich theorem, even when the solution is well-approximated by the trial space, would falsify the local convergence guarantee.

Figures

Figures reproduced from arXiv: 2606.18175 by Abdul-Akeem Olawoyin, Gbenga T. Awojinrin, Rami M. Younis.

Figure 1
Figure 1. Figure 1: Bratu equation: simulated solution fields using P = 25 (top row) and P = 225 (bottom row) for NiL-N, NiL-Q, LiL-N and LiL-Q from left to right. 4.2. 2D Bratu The Bratu equation, introduced as the worked example in Section 3.4, is a nonlinear elliptic PDE that serves as a standard steady model problem. It admits two parameter-dependent solution branches that coalesce for the critical coefficient value λc ≈ … view at source ↗
Figure 2
Figure 2. Figure 2: Bratu equation: convergence histories for various methods using networks with P ∈ {25, 100, 225} parameters. heuristic interventions). LiL-Q converges in 3 iterations at both enriched basis sizes, while LiL-N converges but with a rapidly growing iteration count (1,597 and 7,748 respectively). This suggests that the physical nonlinearity preserves optimization difficulty, even when the architectural nonline… view at source ↗
Figure 3
Figure 3. Figure 3: Bratu equation: convergence histories organized by method [PITH_FULL_IMAGE:figures/full_fig_p020_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Bratu equation: comparison of linearized and nonlinear residual norms for LiL-Q at P ∈ {25, 100, 225}. The asymptotic agreement between the two quantities corroborates the convergence analysis of Section 3.5. 20 [PITH_FULL_IMAGE:figures/full_fig_p020_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Burgers equation: convergence histories organized by method. Each panel shows all five basis sizes for a single formulation [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Burgers equation: comparison of linearized and nonlinear residual norms for LiL-Q at P ∈ {25, 100, 225, 400, 625}. 22 [PITH_FULL_IMAGE:figures/full_fig_p022_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Burgers equation: predicted solution fields at P = 625 [PITH_FULL_IMAGE:figures/full_fig_p023_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Convergence histories for eight LiL configurations on the viscous Burgers equation (P = 625) [PITH_FULL_IMAGE:figures/full_fig_p024_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Buckley-Leverett (viscous flow, Ng = 0): convergence histories organized by method. Each panel shows all four basis sizes for a single formulation [PITH_FULL_IMAGE:figures/full_fig_p025_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Buckley-Leverett (viscous flow, Ng = 0): comparison of linearized and nonlinear residual norms for LiL-Q at P ∈ {64, 256, 576, 1,024}. 25 [PITH_FULL_IMAGE:figures/full_fig_p025_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Buckley-Leverett (gravity-influenced flow, Ng = −5): convergence histories organized by method. The NiL-Q panel exhibits order-of-magnitude loss spikes not observed in any other experiment [PITH_FULL_IMAGE:figures/full_fig_p027_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Buckley-Leverett (gravity-influenced flow, Ng = −5): comparison of linearized and nonlinear residual norms for LiL-Q at P ∈ {64, 256, 576, 1,024}. 27 [PITH_FULL_IMAGE:figures/full_fig_p027_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Buckley-Leverett solutions at P = 1,024: NiL-N (left) and LiL-Q (right) for viscous displacement (top) and gravity￾influenced displacement (bottom). 28 [PITH_FULL_IMAGE:figures/full_fig_p028_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Linear elasticity at P = 50: exact displacement fields (left), LiL predictions (center), and pointwise absolute errors (right). Errors are at O(10−15) throughout the domain, consistent with machine-precision recovery. 31 [PITH_FULL_IMAGE:figures/full_fig_p031_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Kovasznay flow at P = 1,875: exact fields (left), LiL-Q predictions (center), and pointwise absolute errors (right) for u, v, and p. 33 [PITH_FULL_IMAGE:figures/full_fig_p033_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: Beltrami flow at t = 1: exact fields (left), LiL-Q predictions (middle), and pointwise absolute errors (right) for u, v, w, and p. 36 [PITH_FULL_IMAGE:figures/full_fig_p036_16.png] view at source ↗
Figure 17
Figure 17. Figure 17: Permeability fields (log10 K in mD) for the four test cases, with heterogeneity contrast increasing from left to right. comparison is indicative rather than definitive and is conservative for LiL-Q, which runs on CPU. Across all four fields, the LiL boundary conditions are satisfied to machine precision by construction, whereas NiL boundary residuals are small but non-zero. The FVM solves each problem in … view at source ↗
Figure 18
Figure 18. Figure 18: Pressure field solutions for (a) FVM (b) NiL and (c) LiL across the four permeability realizations. 40 [PITH_FULL_IMAGE:figures/full_fig_p040_18.png] view at source ↗
read the original abstract

We present a numerical method for the forward solution of nonlinear partial differential equations (PDEs) in which Bellman-Kalaba quasilinearization reduces the nonlinear problem to a sequence of linear subproblems, each discretized by collocation onto a trial space that is linear in its parameters and solved by a single direct linear least-squares QR factorization. The trial space, which we term Linear-in-Learnables (LiL), comprises representations whose trainable parameters enter linearly, including random-feature extreme learning machines, spectral polynomial bases, and trigonometric expansions, each implemented as a physics-informed neural network. The method thus replaces the nonconvex gradient-based training that limits standard PINNs with a convex per-step solve. We establish local Newton-Kantorovich convergence of the outer iteration to a residual-limited neighborhood under an explicit smallness condition, with the limiting accuracy governed by the best-approximation residual of the trial space rather than by an optimization tolerance. The method, denoted LiL-Q, is assessed on seven benchmarks spanning scalar nonlinear PDEs (Bratu, viscous Burgers, Buckley-Leverett), coupled systems (plane-strain elasticity and the incompressible Navier-Stokes equations in two and three spatial dimensions), and steady-state Darcy flow with heterogeneous permeability. Across these problems, LiL-Q converges in single-digit outer iterations in most cases, even at the coarsest basis sizes and independent of the parameter count. When the exact solution lies in the span of the trial space, the method recovers it to machine precision in a single solve. On the Navier-Stokes benchmarks, it matches or exceeds published PINN solvers with up to two orders of magnitude fewer trainable parameters, without gradient-based optimization.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces the LiL-Q method, which applies Bellman-Kalaba quasilinearization to convert nonlinear PDEs into a sequence of linear subproblems. Each subproblem is discretized via collocation onto a Linear-in-Learnables (LiL) trial space (random-feature ELMs, spectral polynomials, trigonometric bases) and solved by a single direct least-squares QR factorization. The paper establishes local Newton-Kantorovich convergence of the outer iteration to a residual-limited neighborhood under an explicit smallness condition, with limiting accuracy governed by the trial-space best-approximation error. Numerical results on seven benchmarks (Bratu, viscous Burgers, Buckley-Leverett, plane-strain elasticity, 2D/3D incompressible Navier-Stokes, heterogeneous Darcy flow) report single-digit outer iterations, machine-precision recovery when the exact solution lies in the trial space, and competitive accuracy versus published PINNs using up to two orders of magnitude fewer trainable parameters without gradient-based optimization.

Significance. If the local convergence result and the smallness condition hold for the reported initial guesses, the work offers a convex, optimization-free alternative to standard PINN training that replaces nonconvex gradient descent with direct linear solves. The explicit separation of approximation error from optimization tolerance and the empirical observation of rapid convergence independent of parameter count are potentially valuable contributions for forward nonlinear PDE problems.

major comments (2)
  1. [Abstract, convergence paragraph] Abstract, convergence paragraph: The local Newton-Kantorovich convergence to a residual-limited neighborhood is asserted under an explicit smallness condition on the initial residual (or guess), yet neither the precise mathematical statement of this condition nor any verification that it is satisfied by the initial guesses and residuals used in the seven benchmarks is supplied. This verification is load-bearing for the claim that the observed single-digit iteration counts follow from the theory rather than problem-specific initialization or basis choice.
  2. [§5, Table 1] §5 (numerical results) and Table 1: The statement that accuracy is governed by the best-approximation residual of the trial space rather than optimization tolerance is central, but the manuscript does not report the computed best-approximation residuals (or their decay with basis size) for the chosen LiL spaces on the benchmarks, preventing direct confirmation that the observed errors are indeed limited by approximation rather than other factors.
minor comments (2)
  1. [§2] The notation for the LiL trial space and the collocation operator could be made more uniform across sections to improve readability.
  2. [§5.3] Figure captions for the Navier-Stokes benchmarks should explicitly state the achieved residual norms alongside parameter counts for direct comparison with the cited PINN references.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thorough review and valuable comments, which help clarify the presentation of the theoretical guarantees and their relation to the numerical results. We address each major comment below and will incorporate the suggested additions in the revised manuscript.

read point-by-point responses
  1. Referee: [Abstract, convergence paragraph] Abstract, convergence paragraph: The local Newton-Kantorovich convergence to a residual-limited neighborhood is asserted under an explicit smallness condition on the initial residual (or guess), yet neither the precise mathematical statement of this condition nor any verification that it is satisfied by the initial guesses and residuals used in the seven benchmarks is supplied. This verification is load-bearing for the claim that the observed single-digit iteration counts follow from the theory rather than problem-specific initialization or basis choice.

    Authors: The precise mathematical statement of the smallness condition is provided in the Newton-Kantorovich theorem of Section 3. We agree, however, that restating the condition explicitly in the abstract and verifying its satisfaction for the initial guesses employed in the benchmarks would strengthen the link between theory and the reported single-digit iteration counts. In the revision we will (i) insert the exact smallness condition into the abstract convergence paragraph and (ii) add a short verification subsection (or supplementary table) that computes the initial residuals for each of the seven benchmarks and confirms that the condition holds, thereby showing that the observed convergence is consistent with the theorem rather than arising solely from problem-specific choices. revision: yes

  2. Referee: [§5, Table 1] §5 (numerical results) and Table 1: The statement that accuracy is governed by the best-approximation residual of the trial space rather than optimization tolerance is central, but the manuscript does not report the computed best-approximation residuals (or their decay with basis size) for the chosen LiL spaces on the benchmarks, preventing direct confirmation that the observed errors are indeed limited by approximation rather than other factors.

    Authors: We concur that explicit reporting of the best-approximation residuals is necessary to substantiate the claim that the observed errors are governed by the trial-space approximation quality. In the revised manuscript we will compute and tabulate these residuals for the LiL spaces (random-feature ELMs, spectral polynomials, trigonometric bases) used on each benchmark, including their decay with increasing basis size (number of features or polynomial degree). These values will be compared directly with the achieved solution errors in Table 1 and the associated figures, confirming that the errors track the best-approximation bounds rather than any optimization tolerance. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation relies on standard quasilinearization and Newton-Kantorovich theory

full rationale

The paper applies the established Bellman-Kalaba quasilinearization to convert nonlinear PDEs into sequences of linear subproblems, then discretizes each via collocation on a Linear-in-Learnables (LiL) trial space and solves with direct QR least-squares. The local convergence claim invokes the standard Newton-Kantorovich theorem under an explicit smallness condition on the initial residual; this is an external mathematical result, not derived from or equivalent to the paper's own fitted parameters, trial-space choices, or benchmark outputs. No self-definitional loops, fitted inputs renamed as predictions, or load-bearing self-citations appear in the derivation chain. The seven benchmarks serve as independent numerical validation rather than inputs that force the claimed results by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on quasilinearization reducing the nonlinear PDE to linear subproblems solvable by least-squares and on the LiL property enabling convex direct solves; the smallness condition is a key unverified assumption for the convergence result.

axioms (1)
  • domain assumption Local Newton-Kantorovich convergence holds under an explicit smallness condition on the initial residual or guess
    Stated in the abstract as the basis for the convergence guarantee to a residual-limited neighborhood.
invented entities (1)
  • Linear-in-Learnables (LiL) trial space no independent evidence
    purpose: Representations (random-feature ELMs, spectral polynomials, trigonometric expansions) whose parameters enter linearly to permit convex least-squares solves
    New term introduced in the abstract to group the trial spaces used for the convex per-step solves.

pith-pipeline@v0.9.1-grok · 5858 in / 1524 out tokens · 39626 ms · 2026-06-26T23:42:34.771051+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

54 extracted references · 38 canonical work pages

  1. [1]

    Petroleum Reservoir Simulation

    Aziz, K., Settari, A., 1979. Petroleum Reservoir Simulation. Applied Science Publishers, London. doi:10.2118/9781613999646

  2. [2]

    Quasilinearization and Nonlinear Boundary-Value Problems

    Bellman, R., Kalaba, R., 1965. Quasilinearization and Nonlinear Boundary-Value Problems. American Elsevier, New York. doi:10.2307/3612757

  3. [3]

    The calculation of linear least squares problems

    Bjorck, A., 2004. The calculation of linear least squares problems. Acta Numerica 13, 1–53. doi:10. 1017/S0962492904000169

  4. [4]

    Chebyshev and Fourier Spectral Methods

    Boyd, J.P., 2001. Chebyshev and Fourier Spectral Methods. 2nd ed., Dover Publications, Mineola, NY

  5. [5]

    Mechanism of fluid displacement in sands

    Buckley, S.E., Leverett, M.C., 1942. Mechanism of fluid displacement in sands. Transactions of the AIME 146, 107–116. doi:10.2118/942107-G

  6. [6]

    Extreme learning machine collocation for the numerical solution of elliptic pdes with sharp gradients

    Calabrò, F., Fabiani, G., Siettos, C., 2021. Extreme learning machine collocation for the numerical solution of elliptic pdes with sharp gradients. Computer Methods in Applied Mechanics and Engineering 387, 114188. doi:10.1016/j.cma.2021.114188

  7. [7]

    Spectral Methods: Funda- mentals in Single Domains

    Canuto, C.G., Hussaini, M.Y., Quarteroni, A., Zang, T.A., 2006. Spectral Methods: Funda- mentals in Single Domains. Scientific Computation, Springer, Berlin, Heidelberg. doi:10.1007/ 978-3-540-30726-6

  8. [8]

    Tenth spe comparative solution project: A comparison of upscaling techniques

    Christie, M.A., Blunt, M.J., 2001. Tenth spe comparative solution project: A comparison of upscaling techniques. SPE Reservoir Evaluation and Engineering 4, 308–317. doi:10.2118/72469-PA

  9. [9]

    Scientific machine learning through physics-informed neural networks: Where we are and what’s next

    Cuomo, S., Cola, V.S.D., Giampaolo, F., Rozza, G., Raissi, M., Piccialli, F., 2022. Scientific machine learning through physics-informed neural networks: Where we are and what’s next. Journal of Scientific Computing 92, 88. doi:10.1007/s10915-022-01939-z

  10. [10]

    Inexact newton methods

    Dembo, R.S., Eisenstat, S.C., Steihaug, T., 1982. Inexact newton methods. SIAM Journal on Numerical Analysis 19, 400–408. doi:10.1137/0719025

  11. [11]

    Newton Methods for Nonlinear Problems: Affine Invariance and Adaptive Algo- rithms

    Deuflhard, P., 2011. Newton Methods for Nonlinear Problems: Affine Invariance and Adaptive Algo- rithms. 2nd ed., Springer, Berlin. doi:10.1007/978-3-642-23899-4

  12. [12]

    Local extreme learning machines and domain decomposition for solving linear and nonlinear partial differential equations

    Dong, S., Li, Z., 2021. Local extreme learning machines and domain decomposition for solving linear and nonlinear partial differential equations. Computer Methods in Applied Mechanics and Engineering 387, 114129. doi:10.1016/j.cma.2021.114129

  13. [13]

    Physics informed extreme learning machine (PIELM) – a rapid methodforthenumericalsolutionofpartialdifferentialequations

    Dwivedi, V., Srinivasan, B., 2020. Physics informed extreme learning machine (PIELM) – a rapid methodforthenumericalsolutionofpartialdifferentialequations. Neurocomputing391, 96–118. doi:10. 1016/j.neucom.2019.12.099

  14. [14]

    Physics informed deep learning for flow and transport in porous media, in: SPE Reservoir Simulation Conference, Society of Petroleum Engineers

    Fraces, C.G., Tchelepi, H.A., 2021. Physics informed deep learning for flow and transport in porous media, in: SPE Reservoir Simulation Conference, Society of Petroleum Engineers. p. D011S006R002. doi:10.2118/203934-MS. sPE-203934-MS

  15. [15]

    Limitations of physics informed machine learning for nonlinear two- phase transport in porous media

    Fuks, O., Tchelepi, H.A., 2020. Limitations of physics informed machine learning for nonlinear two- phase transport in porous media. Journal of Machine Learning for Modeling and Computing 1, 19–37. doi:10.1615/JMachLearnModelComput.2020033905

  16. [16]

    Deep Learning

    Goodfellow, I., Bengio, Y., Courville, A., 2016. Deep Learning. MIT Press, Cambridge, MA. URL: http://www.deeplearningbook.org. 43

  17. [17]

    Can physics-informed neural networks beat the finite element method? IMA Journal of Applied Mathematics 89, 143–174

    Grossmann, T.G., Komorowska, U.J., Latz, J., Schönlieb, C.B., 2024. Can physics-informed neural networks beat the finite element method? IMA Journal of Applied Mathematics 89, 143–174. doi:10. 1093/imamat/hxae011

  18. [18]

    Physics-informed neural networks with domain decomposition for the incompressible navier-stokes equations

    Gu, L., Qin, S., Xu, L., Chen, R., 2024. Physics-informed neural networks with domain decomposition for the incompressible navier-stokes equations. Physics of Fluids 36, 021914. doi:10.1063/5.0188830

  19. [19]

    A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics

    Haghighat, E., Raissi, M., Moure, A., Gomez, H., Juanes, R., 2021. A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics. Computer Methods in Applied Mechanics and Engineering 379, 113741. doi:10.1016/j.cma.2021.113741

  20. [20]

    Newton informed neural operator for solving nonlinear partial differential equations, in: Advances in Neural Information Processing Systems (NeurIPS), pp

    Hao, W., Liu, X., Yang, Y., 2024. Newton informed neural operator for solving nonlinear partial differential equations, in: Advances in Neural Information Processing Systems (NeurIPS), pp. 120832– 120860. URL:https://pmc.ncbi.nlm.nih.gov/articles/PMC11973962/

  21. [21]

    Extreme learning machine: Theory and applications

    Huang, G.B., Zhu, Q.Y., Siew, C.K., 2006. Extreme learning machine: Theory and applications. Neurocomputing 70, 489–501. doi:10.1016/j.neucom.2005.12.126

  22. [22]

    HomPINNs: Homotopy physics-informed neural networks for learning multiple solutions of nonlinear elliptic differential equations

    Huang, Y., Hao, W., Lin, G., 2022. HomPINNs: Homotopy physics-informed neural networks for learning multiple solutions of nonlinear elliptic differential equations. Computers and Mathematics with Applications 121, 62–73. doi:10.1016/j.camwa.2022.07.002

  23. [23]

    Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems

    Jagtap, A.D., Kharazmi, E., Karniadakis, G.E., 2020. Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems. Computer Methods in Applied Mechanics and Engineering 365, 113028. doi:10.1016/j.cma.2020.113028

  24. [24]

    NSFnets (navier-stokes flow nets): Physics-informed neural networks for the incompressible navier-stokes equations

    Jin, X., Cai, S., Li, H., Karniadakis, G.E., 2021. NSFnets (navier-stokes flow nets): Physics-informed neural networks for the incompressible navier-stokes equations. Journal of Computational Physics 426, 109951. doi:10.1016/j.jcp.2020.109951

  25. [25]

    Functional Analysis

    Kantorovich, L.V., Akilov, G.P., 1982. Functional Analysis. Elsevier. doi:10.1016/C2013-0-03044-7

  26. [26]

    Nature Reviews Physics3(6), 422–440 (2021) https://doi

    Karniadakis, G.E., Kevrekidis, I.G., Lu, L., Perdikaris, P., Wang, S., Yang, L., 2021. Physics-informed machine learning. Nature Reviews Physics 3, 422–440. doi:10.1038/s42254-021-00314-5

  27. [27]

    hp-VPINNs: Variational physics-informed neural networks with domain decomposition

    Kharazmi, E., Zhang, Z., Karniadakis, G.E., 2021. hp-VPINNs: Variational physics-informed neural networks with domain decomposition. Computer Methods in Applied Mechanics and Engineering 374, 113547. doi:10.1016/j.cma.2020.113547

  28. [28]

    Characterizing possible failure modes in physics-informed neural networks, in: Advances in Neural Information Processing Systems, pp

    Krishnapriyan, A.S., Gholami, A., Zhe, S., Kirby, R.M., Mahoney, M.W., 2021. Characterizing possible failure modes in physics-informed neural networks, in: Advances in Neural Information Processing Systems, pp. 26548–26560. URL:https://arxiv.org/abs/2109.01050

  29. [29]

    Anti-derivatives approximator for enhancing physics-informed neural networks

    Lee, J., 2024. Anti-derivatives approximator for enhancing physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering 426, 117000. doi:10.1016/j.cma.2024.117000

  30. [30]

    Finite Difference Methods for Ordinary and Partial Differential Equations: Steady- State and Time-Dependent Problems

    LeVeque, R.J., 2007. Finite Difference Methods for Ordinary and Partial Differential Equations: Steady- State and Time-Dependent Problems. SIAM, Philadelphia. doi:10.1137/1.9780898717839

  31. [31]

    Nonlinear analysis of multiphase transport in porous media in the presence of viscous, buoyancy, and capillary forces

    Li, B., Tchelepi, H.A., 2015. Nonlinear analysis of multiphase transport in porous media in the presence of viscous, buoyancy, and capillary forces. Journal of Computational Physics 297, 104–131. doi:10. 1016/j.jcp.2015.04.057

  32. [32]

    Locally linearized physics-informed neural networks for riemann problems of hyperbolic conservation laws

    Liu, J., Zheng, S., Song, X., Xu, D., 2024. Locally linearized physics-informed neural networks for riemann problems of hyperbolic conservation laws. Physics of Fluids 36, 116135. doi:10.1063/5. 0238865. 44

  33. [33]

    DeepXDE: A deep learning library for solving differential equations

    Lu, L., Meng, X., Mao, Z., Karniadakis, G.E., 2021. DeepXDE: A deep learning library for solving differential equations. SIAM Review 63, 208–228. doi:10.1137/19M1274067

  34. [34]

    Quasilinearization approach to nonlinear problems in physics with application to nonlinear odes

    Mandelzweig, V.B., Tabakin, F., 2001. Quasilinearization approach to nonlinear problems in physics with application to nonlinear odes. Computer Physics Communications 141, 268–281. doi:10.1016/ S0010-4655(01)00415-5

  35. [36]

    A novel sequential method to train physics informed neural networks for allen cahn and cahn hilliard equations

    Mattey, R., Ghosh, S., 2022. A novel sequential method to train physics informed neural networks for allen cahn and cahn hilliard equations. Computer Methods in Applied Mechanics and Engineering 390, 114474. doi:10.1016/j.cma.2021.114474

  36. [37]

    Self-adaptive physics-informed neural networks using a soft attention mechanism

    McClenny, L.D., Braga-Neto, U.M., 2023. Self-adaptive physics-informed neural networks using a soft attention mechanism. Journal of Computational Physics 474, 111722. doi:10.1016/j.jcp.2022. 111722

  37. [38]

    Weak baselines and reporting biases lead to overoptimism in machine learning for fluid mechanics

    McGreivy, N., Hakim, A., 2024. Weak baselines and reporting biases lead to overoptimism in machine learning for fluid mechanics. Nature Machine Intelligence 6, 1177–1189. doi:10.1038/ s42256-024-00897-5

  38. [39]

    Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations.J

    Raissi, M., Perdikaris, P., Karniadakis, G.E., 2019. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics 378, 686–707. doi:10.1016/j.jcp.2018.10.045

  39. [40]

    Challenges in training pinns: A loss land- scape perspective, in: Proceedings of the 41st International Conference on Machine Learning (ICML), pp

    Rathore, P., Lei, W., Frangella, Z., Lu, L., Udell, M., 2024. Challenges in training pinns: A loss land- scape perspective, in: Proceedings of the 41st International Conference on Machine Learning (ICML), pp. 40828–40848. doi:10.5555/3692070.3693785

  40. [41]

    Exact enforcement of temporal continuity in sequential physics- informed neural networks

    Roy, P., Castonguay, S.T., 2024. Exact enforcement of temporal continuity in sequential physics- informed neural networks. Computer Methods in Applied Mechanics and Engineering 430, 117197. doi:10.1016/j.cma.2024.117197

  41. [42]

    Numerical analysis of physics-informed neural networks and re- lated models in physics-informed machine learning

    Ryck, T.D., Mishra, S., 2024. Numerical analysis of physics-informed neural networks and re- lated models in physics-informed machine learning. Acta Numerica 33, 633–713. doi:10.1017/ S0962492923000089

  42. [43]

    Sparse linear least-squares problems

    Scott, J., Tuma, M., 2025. Sparse linear least-squares problems. Acta Numerica 34, 891–1010. doi:10. 1017/S0962492924000059

  43. [44]

    Localized linear systems for fully implicit simulation of multiphase multicomponent flow in porous media

    Sheth, S., Moncorgé, A., Younis, R., 2020. Localized linear systems for fully implicit simulation of multiphase multicomponent flow in porous media. Computational Geosciences 24, 743–759. doi:10. 1007/s10596-019-09840-9

  44. [45]

    Localized linear systems in sequential implicit simulation of two-phase flow and transport

    Sheth, S.M., Younis, R.M., 2017. Localized linear systems in sequential implicit simulation of two-phase flow and transport. SPE Journal 22, 1542–1569. doi:10.2118/173320-PA

  45. [46]

    On the convergence of physics informed neural networks for linear second-order elliptic and parabolic type pdes

    Shin, Y., Darbon, J., Karniadakis, G.E., 2020. On the convergence of physics informed neural networks for linear second-order elliptic and parabolic type pdes. Communications in Computational Physics 28, 2042–2074. doi:10.4208/cicp.OA-2020-0193

  46. [47]

    An Analysis of the Finite Element Method

    Strang, G., Fix, G.J., 1973. An Analysis of the Finite Element Method. Prentice-Hall, Englewood Cliffs, NJ. doi:10.1137/1.9780980232707. 45

  47. [48]

    Fourier features let networks learn high frequency functions in low dimensional domains, in: Advances in Neural Information Processing Systems, pp

    Tancik, M., Srinivasan, P.P., Mildenhall, B., Fridovich-Keil, S., Raghavan, N., Singhal, U., Ramamoor- thi, R., Barron, J.T., Ng, R., 2020. Fourier features let networks learn high frequency functions in low dimensional domains, in: Advances in Neural Information Processing Systems, pp. 7537–7547. doi:10.5555/3495724.3496356

  48. [49]

    Spectral Methods in MATLAB

    Trefethen, L.N., 2000. Spectral Methods in MATLAB. Society for Industrial and Applied Mathematics. doi:10.1137/1.9780898719598

  49. [50]

    Ziegler, M

    Wang, S., Sankaran, S., Perdikaris, P., 2024. Respecting causality for training physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering 421, 116813. doi:10.1016/j.cma. 2024.116813

  50. [51]

    Understanding and mitigating gradient flow pathologies in physics-informed neural networks

    Wang, S., Teng, Y., Perdikaris, P., 2021a. Understanding and mitigating gradient flow pathologies in physics-informed neural networks. SIAM Journal on Scientific Computing 43, A3055–A3081. doi:10. 1137/20M1318043

  51. [52]

    On the eigenvector bias of fourier feature networks: From regression to solving multi-scale pdes with physics-informed neural networks

    Wang, S., Wang, H., Perdikaris, P., 2021b. On the eigenvector bias of fourier feature networks: From regression to solving multi-scale pdes with physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering 384, 113938. doi:10.1016/j.cma.2021.113938

  52. [53]

    LD McClenny and UM Braga-Neto

    Wang, S., Yu, X., Perdikaris, P., 2022. When and why pinns fail to train: A neural tangent kernel perspective. Journal of Computational Physics 449, 110768. doi:10.1016/j.jcp.2021.110768

  53. [54]

    Adaptively localized continuation-newton method—nonlinear solvers that converge all the time

    Younis, R.M., Tchelepi, H.A., Aziz, K., 2009. Adaptively localized continuation-newton method—nonlinear solvers that converge all the time. SPE Journal 15, 526–544. doi:10.2118/ 119147-PA

  54. [55]

    Gradient-enhanced physics-informed neural networks for forward and inverse pde problems

    Yu, J., Lu, L., Meng, X., Karniadakis, G.E., 2022. Gradient-enhanced physics-informed neural networks for forward and inverse pde problems. Computer Methods in Applied Mechanics and Engineering 393, 114823. doi:10.1016/j.cma.2022.114823. 46