A Convex Quasilinearization Method for Solving Nonlinear PDEs with Physics-Informed Neural Networks
Pith reviewed 2026-06-26 23:42 UTC · model grok-4.3
The pith
Quasilinearization converts nonlinear PDEs into sequences of linear least-squares problems solved directly on linear-in-parameters trial spaces.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central discovery is that Bellman-Kalaba quasilinearization combined with Linear-in-Learnables trial spaces allows the forward solution of nonlinear PDEs through a short sequence of convex linear least-squares solves whose accuracy is limited solely by the best-approximation residual of the chosen trial space.
What carries the argument
Linear-in-Learnables (LiL) trial spaces, which are representations whose trainable parameters enter linearly into the model, solved by direct QR factorization after quasilinearization reduces the nonlinear PDE to linear subproblems.
If this is right
- When the exact solution lies exactly in the trial space, the method recovers it to machine precision in a single solve.
- The outer iteration converges in single-digit steps for most benchmarks, independent of the number of parameters.
- On the incompressible Navier-Stokes equations the approach matches or exceeds published PINN results while using up to two orders of magnitude fewer trainable parameters.
- The limiting accuracy is set by the approximation power of the trial space, not by the optimization procedure.
Where Pith is reading between the lines
- The approach may allow stable solutions for stiff nonlinear problems where gradient-based methods struggle with local minima.
- Extending the LiL spaces to include adaptive or hierarchical bases could further improve efficiency for high-dimensional problems.
- Since each step is a convex solve, the method could be combined with uncertainty quantification techniques that rely on linear algebra.
Load-bearing premise
The quasilinearization iteration is guaranteed to converge only when the initial guess satisfies an explicit smallness condition on the residual norm.
What would settle it
Demonstrating failure to converge for an initial guess that violates the smallness condition in the Newton-Kantorovich theorem, even when the solution is well-approximated by the trial space, would falsify the local convergence guarantee.
Figures
read the original abstract
We present a numerical method for the forward solution of nonlinear partial differential equations (PDEs) in which Bellman-Kalaba quasilinearization reduces the nonlinear problem to a sequence of linear subproblems, each discretized by collocation onto a trial space that is linear in its parameters and solved by a single direct linear least-squares QR factorization. The trial space, which we term Linear-in-Learnables (LiL), comprises representations whose trainable parameters enter linearly, including random-feature extreme learning machines, spectral polynomial bases, and trigonometric expansions, each implemented as a physics-informed neural network. The method thus replaces the nonconvex gradient-based training that limits standard PINNs with a convex per-step solve. We establish local Newton-Kantorovich convergence of the outer iteration to a residual-limited neighborhood under an explicit smallness condition, with the limiting accuracy governed by the best-approximation residual of the trial space rather than by an optimization tolerance. The method, denoted LiL-Q, is assessed on seven benchmarks spanning scalar nonlinear PDEs (Bratu, viscous Burgers, Buckley-Leverett), coupled systems (plane-strain elasticity and the incompressible Navier-Stokes equations in two and three spatial dimensions), and steady-state Darcy flow with heterogeneous permeability. Across these problems, LiL-Q converges in single-digit outer iterations in most cases, even at the coarsest basis sizes and independent of the parameter count. When the exact solution lies in the span of the trial space, the method recovers it to machine precision in a single solve. On the Navier-Stokes benchmarks, it matches or exceeds published PINN solvers with up to two orders of magnitude fewer trainable parameters, without gradient-based optimization.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces the LiL-Q method, which applies Bellman-Kalaba quasilinearization to convert nonlinear PDEs into a sequence of linear subproblems. Each subproblem is discretized via collocation onto a Linear-in-Learnables (LiL) trial space (random-feature ELMs, spectral polynomials, trigonometric bases) and solved by a single direct least-squares QR factorization. The paper establishes local Newton-Kantorovich convergence of the outer iteration to a residual-limited neighborhood under an explicit smallness condition, with limiting accuracy governed by the trial-space best-approximation error. Numerical results on seven benchmarks (Bratu, viscous Burgers, Buckley-Leverett, plane-strain elasticity, 2D/3D incompressible Navier-Stokes, heterogeneous Darcy flow) report single-digit outer iterations, machine-precision recovery when the exact solution lies in the trial space, and competitive accuracy versus published PINNs using up to two orders of magnitude fewer trainable parameters without gradient-based optimization.
Significance. If the local convergence result and the smallness condition hold for the reported initial guesses, the work offers a convex, optimization-free alternative to standard PINN training that replaces nonconvex gradient descent with direct linear solves. The explicit separation of approximation error from optimization tolerance and the empirical observation of rapid convergence independent of parameter count are potentially valuable contributions for forward nonlinear PDE problems.
major comments (2)
- [Abstract, convergence paragraph] Abstract, convergence paragraph: The local Newton-Kantorovich convergence to a residual-limited neighborhood is asserted under an explicit smallness condition on the initial residual (or guess), yet neither the precise mathematical statement of this condition nor any verification that it is satisfied by the initial guesses and residuals used in the seven benchmarks is supplied. This verification is load-bearing for the claim that the observed single-digit iteration counts follow from the theory rather than problem-specific initialization or basis choice.
- [§5, Table 1] §5 (numerical results) and Table 1: The statement that accuracy is governed by the best-approximation residual of the trial space rather than optimization tolerance is central, but the manuscript does not report the computed best-approximation residuals (or their decay with basis size) for the chosen LiL spaces on the benchmarks, preventing direct confirmation that the observed errors are indeed limited by approximation rather than other factors.
minor comments (2)
- [§2] The notation for the LiL trial space and the collocation operator could be made more uniform across sections to improve readability.
- [§5.3] Figure captions for the Navier-Stokes benchmarks should explicitly state the achieved residual norms alongside parameter counts for direct comparison with the cited PINN references.
Simulated Author's Rebuttal
We thank the referee for the thorough review and valuable comments, which help clarify the presentation of the theoretical guarantees and their relation to the numerical results. We address each major comment below and will incorporate the suggested additions in the revised manuscript.
read point-by-point responses
-
Referee: [Abstract, convergence paragraph] Abstract, convergence paragraph: The local Newton-Kantorovich convergence to a residual-limited neighborhood is asserted under an explicit smallness condition on the initial residual (or guess), yet neither the precise mathematical statement of this condition nor any verification that it is satisfied by the initial guesses and residuals used in the seven benchmarks is supplied. This verification is load-bearing for the claim that the observed single-digit iteration counts follow from the theory rather than problem-specific initialization or basis choice.
Authors: The precise mathematical statement of the smallness condition is provided in the Newton-Kantorovich theorem of Section 3. We agree, however, that restating the condition explicitly in the abstract and verifying its satisfaction for the initial guesses employed in the benchmarks would strengthen the link between theory and the reported single-digit iteration counts. In the revision we will (i) insert the exact smallness condition into the abstract convergence paragraph and (ii) add a short verification subsection (or supplementary table) that computes the initial residuals for each of the seven benchmarks and confirms that the condition holds, thereby showing that the observed convergence is consistent with the theorem rather than arising solely from problem-specific choices. revision: yes
-
Referee: [§5, Table 1] §5 (numerical results) and Table 1: The statement that accuracy is governed by the best-approximation residual of the trial space rather than optimization tolerance is central, but the manuscript does not report the computed best-approximation residuals (or their decay with basis size) for the chosen LiL spaces on the benchmarks, preventing direct confirmation that the observed errors are indeed limited by approximation rather than other factors.
Authors: We concur that explicit reporting of the best-approximation residuals is necessary to substantiate the claim that the observed errors are governed by the trial-space approximation quality. In the revised manuscript we will compute and tabulate these residuals for the LiL spaces (random-feature ELMs, spectral polynomials, trigonometric bases) used on each benchmark, including their decay with increasing basis size (number of features or polynomial degree). These values will be compared directly with the achieved solution errors in Table 1 and the associated figures, confirming that the errors track the best-approximation bounds rather than any optimization tolerance. revision: yes
Circularity Check
No significant circularity; derivation relies on standard quasilinearization and Newton-Kantorovich theory
full rationale
The paper applies the established Bellman-Kalaba quasilinearization to convert nonlinear PDEs into sequences of linear subproblems, then discretizes each via collocation on a Linear-in-Learnables (LiL) trial space and solves with direct QR least-squares. The local convergence claim invokes the standard Newton-Kantorovich theorem under an explicit smallness condition on the initial residual; this is an external mathematical result, not derived from or equivalent to the paper's own fitted parameters, trial-space choices, or benchmark outputs. No self-definitional loops, fitted inputs renamed as predictions, or load-bearing self-citations appear in the derivation chain. The seven benchmarks serve as independent numerical validation rather than inputs that force the claimed results by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Local Newton-Kantorovich convergence holds under an explicit smallness condition on the initial residual or guess
invented entities (1)
-
Linear-in-Learnables (LiL) trial space
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Petroleum Reservoir Simulation
Aziz, K., Settari, A., 1979. Petroleum Reservoir Simulation. Applied Science Publishers, London. doi:10.2118/9781613999646
-
[2]
Quasilinearization and Nonlinear Boundary-Value Problems
Bellman, R., Kalaba, R., 1965. Quasilinearization and Nonlinear Boundary-Value Problems. American Elsevier, New York. doi:10.2307/3612757
-
[3]
The calculation of linear least squares problems
Bjorck, A., 2004. The calculation of linear least squares problems. Acta Numerica 13, 1–53. doi:10. 1017/S0962492904000169
2004
-
[4]
Chebyshev and Fourier Spectral Methods
Boyd, J.P., 2001. Chebyshev and Fourier Spectral Methods. 2nd ed., Dover Publications, Mineola, NY
2001
-
[5]
Mechanism of fluid displacement in sands
Buckley, S.E., Leverett, M.C., 1942. Mechanism of fluid displacement in sands. Transactions of the AIME 146, 107–116. doi:10.2118/942107-G
-
[6]
Calabrò, F., Fabiani, G., Siettos, C., 2021. Extreme learning machine collocation for the numerical solution of elliptic pdes with sharp gradients. Computer Methods in Applied Mechanics and Engineering 387, 114188. doi:10.1016/j.cma.2021.114188
-
[7]
Spectral Methods: Funda- mentals in Single Domains
Canuto, C.G., Hussaini, M.Y., Quarteroni, A., Zang, T.A., 2006. Spectral Methods: Funda- mentals in Single Domains. Scientific Computation, Springer, Berlin, Heidelberg. doi:10.1007/ 978-3-540-30726-6
2006
-
[8]
Tenth spe comparative solution project: A comparison of upscaling techniques
Christie, M.A., Blunt, M.J., 2001. Tenth spe comparative solution project: A comparison of upscaling techniques. SPE Reservoir Evaluation and Engineering 4, 308–317. doi:10.2118/72469-PA
-
[9]
Scientific machine learning through physics-informed neural networks: Where we are and what’s next
Cuomo, S., Cola, V.S.D., Giampaolo, F., Rozza, G., Raissi, M., Piccialli, F., 2022. Scientific machine learning through physics-informed neural networks: Where we are and what’s next. Journal of Scientific Computing 92, 88. doi:10.1007/s10915-022-01939-z
-
[10]
Dembo, R.S., Eisenstat, S.C., Steihaug, T., 1982. Inexact newton methods. SIAM Journal on Numerical Analysis 19, 400–408. doi:10.1137/0719025
-
[11]
Newton Methods for Nonlinear Problems: Affine Invariance and Adaptive Algo- rithms
Deuflhard, P., 2011. Newton Methods for Nonlinear Problems: Affine Invariance and Adaptive Algo- rithms. 2nd ed., Springer, Berlin. doi:10.1007/978-3-642-23899-4
-
[12]
Dong, S., Li, Z., 2021. Local extreme learning machines and domain decomposition for solving linear and nonlinear partial differential equations. Computer Methods in Applied Mechanics and Engineering 387, 114129. doi:10.1016/j.cma.2021.114129
-
[13]
Physics informed extreme learning machine (PIELM) – a rapid methodforthenumericalsolutionofpartialdifferentialequations
Dwivedi, V., Srinivasan, B., 2020. Physics informed extreme learning machine (PIELM) – a rapid methodforthenumericalsolutionofpartialdifferentialequations. Neurocomputing391, 96–118. doi:10. 1016/j.neucom.2019.12.099
2020
-
[14]
Fraces, C.G., Tchelepi, H.A., 2021. Physics informed deep learning for flow and transport in porous media, in: SPE Reservoir Simulation Conference, Society of Petroleum Engineers. p. D011S006R002. doi:10.2118/203934-MS. sPE-203934-MS
-
[15]
Limitations of physics informed machine learning for nonlinear two- phase transport in porous media
Fuks, O., Tchelepi, H.A., 2020. Limitations of physics informed machine learning for nonlinear two- phase transport in porous media. Journal of Machine Learning for Modeling and Computing 1, 19–37. doi:10.1615/JMachLearnModelComput.2020033905
-
[16]
Deep Learning
Goodfellow, I., Bengio, Y., Courville, A., 2016. Deep Learning. MIT Press, Cambridge, MA. URL: http://www.deeplearningbook.org. 43
2016
-
[17]
Can physics-informed neural networks beat the finite element method? IMA Journal of Applied Mathematics 89, 143–174
Grossmann, T.G., Komorowska, U.J., Latz, J., Schönlieb, C.B., 2024. Can physics-informed neural networks beat the finite element method? IMA Journal of Applied Mathematics 89, 143–174. doi:10. 1093/imamat/hxae011
2024
-
[18]
Gu, L., Qin, S., Xu, L., Chen, R., 2024. Physics-informed neural networks with domain decomposition for the incompressible navier-stokes equations. Physics of Fluids 36, 021914. doi:10.1063/5.0188830
-
[19]
A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics
Haghighat, E., Raissi, M., Moure, A., Gomez, H., Juanes, R., 2021. A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics. Computer Methods in Applied Mechanics and Engineering 379, 113741. doi:10.1016/j.cma.2021.113741
-
[20]
Newton informed neural operator for solving nonlinear partial differential equations, in: Advances in Neural Information Processing Systems (NeurIPS), pp
Hao, W., Liu, X., Yang, Y., 2024. Newton informed neural operator for solving nonlinear partial differential equations, in: Advances in Neural Information Processing Systems (NeurIPS), pp. 120832– 120860. URL:https://pmc.ncbi.nlm.nih.gov/articles/PMC11973962/
2024
-
[21]
Extreme learning machine: Theory and applications
Huang, G.B., Zhu, Q.Y., Siew, C.K., 2006. Extreme learning machine: Theory and applications. Neurocomputing 70, 489–501. doi:10.1016/j.neucom.2005.12.126
-
[22]
Huang, Y., Hao, W., Lin, G., 2022. HomPINNs: Homotopy physics-informed neural networks for learning multiple solutions of nonlinear elliptic differential equations. Computers and Mathematics with Applications 121, 62–73. doi:10.1016/j.camwa.2022.07.002
-
[23]
Jagtap, A.D., Kharazmi, E., Karniadakis, G.E., 2020. Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems. Computer Methods in Applied Mechanics and Engineering 365, 113028. doi:10.1016/j.cma.2020.113028
-
[24]
Jin, X., Cai, S., Li, H., Karniadakis, G.E., 2021. NSFnets (navier-stokes flow nets): Physics-informed neural networks for the incompressible navier-stokes equations. Journal of Computational Physics 426, 109951. doi:10.1016/j.jcp.2020.109951
-
[25]
Kantorovich, L.V., Akilov, G.P., 1982. Functional Analysis. Elsevier. doi:10.1016/C2013-0-03044-7
-
[26]
Nature Reviews Physics3(6), 422–440 (2021) https://doi
Karniadakis, G.E., Kevrekidis, I.G., Lu, L., Perdikaris, P., Wang, S., Yang, L., 2021. Physics-informed machine learning. Nature Reviews Physics 3, 422–440. doi:10.1038/s42254-021-00314-5
-
[27]
hp-VPINNs: Variational physics-informed neural networks with domain decomposition
Kharazmi, E., Zhang, Z., Karniadakis, G.E., 2021. hp-VPINNs: Variational physics-informed neural networks with domain decomposition. Computer Methods in Applied Mechanics and Engineering 374, 113547. doi:10.1016/j.cma.2020.113547
-
[28]
Krishnapriyan, A.S., Gholami, A., Zhe, S., Kirby, R.M., Mahoney, M.W., 2021. Characterizing possible failure modes in physics-informed neural networks, in: Advances in Neural Information Processing Systems, pp. 26548–26560. URL:https://arxiv.org/abs/2109.01050
arXiv 2021
-
[29]
Anti-derivatives approximator for enhancing physics-informed neural networks
Lee, J., 2024. Anti-derivatives approximator for enhancing physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering 426, 117000. doi:10.1016/j.cma.2024.117000
-
[30]
LeVeque, R.J., 2007. Finite Difference Methods for Ordinary and Partial Differential Equations: Steady- State and Time-Dependent Problems. SIAM, Philadelphia. doi:10.1137/1.9780898717839
-
[31]
Nonlinear analysis of multiphase transport in porous media in the presence of viscous, buoyancy, and capillary forces
Li, B., Tchelepi, H.A., 2015. Nonlinear analysis of multiphase transport in porous media in the presence of viscous, buoyancy, and capillary forces. Journal of Computational Physics 297, 104–131. doi:10. 1016/j.jcp.2015.04.057
2015
-
[32]
Liu, J., Zheng, S., Song, X., Xu, D., 2024. Locally linearized physics-informed neural networks for riemann problems of hyperbolic conservation laws. Physics of Fluids 36, 116135. doi:10.1063/5. 0238865. 44
work page doi:10.1063/5 2024
-
[33]
DeepXDE: A deep learning library for solving differential equations
Lu, L., Meng, X., Mao, Z., Karniadakis, G.E., 2021. DeepXDE: A deep learning library for solving differential equations. SIAM Review 63, 208–228. doi:10.1137/19M1274067
-
[34]
Quasilinearization approach to nonlinear problems in physics with application to nonlinear odes
Mandelzweig, V.B., Tabakin, F., 2001. Quasilinearization approach to nonlinear problems in physics with application to nonlinear odes. Computer Physics Communications 141, 268–281. doi:10.1016/ S0010-4655(01)00415-5
2001
-
[36]
Mattey, R., Ghosh, S., 2022. A novel sequential method to train physics informed neural networks for allen cahn and cahn hilliard equations. Computer Methods in Applied Mechanics and Engineering 390, 114474. doi:10.1016/j.cma.2021.114474
-
[37]
Self-adaptive physics-informed neural networks using a soft attention mechanism
McClenny, L.D., Braga-Neto, U.M., 2023. Self-adaptive physics-informed neural networks using a soft attention mechanism. Journal of Computational Physics 474, 111722. doi:10.1016/j.jcp.2022. 111722
-
[38]
Weak baselines and reporting biases lead to overoptimism in machine learning for fluid mechanics
McGreivy, N., Hakim, A., 2024. Weak baselines and reporting biases lead to overoptimism in machine learning for fluid mechanics. Nature Machine Intelligence 6, 1177–1189. doi:10.1038/ s42256-024-00897-5
2024
-
[39]
Raissi, M., Perdikaris, P., Karniadakis, G.E., 2019. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics 378, 686–707. doi:10.1016/j.jcp.2018.10.045
-
[40]
Rathore, P., Lei, W., Frangella, Z., Lu, L., Udell, M., 2024. Challenges in training pinns: A loss land- scape perspective, in: Proceedings of the 41st International Conference on Machine Learning (ICML), pp. 40828–40848. doi:10.5555/3692070.3693785
-
[41]
Exact enforcement of temporal continuity in sequential physics- informed neural networks
Roy, P., Castonguay, S.T., 2024. Exact enforcement of temporal continuity in sequential physics- informed neural networks. Computer Methods in Applied Mechanics and Engineering 430, 117197. doi:10.1016/j.cma.2024.117197
-
[42]
Numerical analysis of physics-informed neural networks and re- lated models in physics-informed machine learning
Ryck, T.D., Mishra, S., 2024. Numerical analysis of physics-informed neural networks and re- lated models in physics-informed machine learning. Acta Numerica 33, 633–713. doi:10.1017/ S0962492923000089
2024
-
[43]
Sparse linear least-squares problems
Scott, J., Tuma, M., 2025. Sparse linear least-squares problems. Acta Numerica 34, 891–1010. doi:10. 1017/S0962492924000059
2025
-
[44]
Localized linear systems for fully implicit simulation of multiphase multicomponent flow in porous media
Sheth, S., Moncorgé, A., Younis, R., 2020. Localized linear systems for fully implicit simulation of multiphase multicomponent flow in porous media. Computational Geosciences 24, 743–759. doi:10. 1007/s10596-019-09840-9
2020
-
[45]
Localized linear systems in sequential implicit simulation of two-phase flow and transport
Sheth, S.M., Younis, R.M., 2017. Localized linear systems in sequential implicit simulation of two-phase flow and transport. SPE Journal 22, 1542–1569. doi:10.2118/173320-PA
-
[46]
Shin, Y., Darbon, J., Karniadakis, G.E., 2020. On the convergence of physics informed neural networks for linear second-order elliptic and parabolic type pdes. Communications in Computational Physics 28, 2042–2074. doi:10.4208/cicp.OA-2020-0193
-
[47]
An Analysis of the Finite Element Method
Strang, G., Fix, G.J., 1973. An Analysis of the Finite Element Method. Prentice-Hall, Englewood Cliffs, NJ. doi:10.1137/1.9780980232707. 45
-
[48]
Tancik, M., Srinivasan, P.P., Mildenhall, B., Fridovich-Keil, S., Raghavan, N., Singhal, U., Ramamoor- thi, R., Barron, J.T., Ng, R., 2020. Fourier features let networks learn high frequency functions in low dimensional domains, in: Advances in Neural Information Processing Systems, pp. 7537–7547. doi:10.5555/3495724.3496356
-
[49]
Trefethen, L.N., 2000. Spectral Methods in MATLAB. Society for Industrial and Applied Mathematics. doi:10.1137/1.9780898719598
-
[50]
Wang, S., Sankaran, S., Perdikaris, P., 2024. Respecting causality for training physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering 421, 116813. doi:10.1016/j.cma. 2024.116813
-
[51]
Understanding and mitigating gradient flow pathologies in physics-informed neural networks
Wang, S., Teng, Y., Perdikaris, P., 2021a. Understanding and mitigating gradient flow pathologies in physics-informed neural networks. SIAM Journal on Scientific Computing 43, A3055–A3081. doi:10. 1137/20M1318043
-
[52]
Wang, S., Wang, H., Perdikaris, P., 2021b. On the eigenvector bias of fourier feature networks: From regression to solving multi-scale pdes with physics-informed neural networks. Computer Methods in Applied Mechanics and Engineering 384, 113938. doi:10.1016/j.cma.2021.113938
-
[53]
Wang, S., Yu, X., Perdikaris, P., 2022. When and why pinns fail to train: A neural tangent kernel perspective. Journal of Computational Physics 449, 110768. doi:10.1016/j.jcp.2021.110768
-
[54]
Adaptively localized continuation-newton method—nonlinear solvers that converge all the time
Younis, R.M., Tchelepi, H.A., Aziz, K., 2009. Adaptively localized continuation-newton method—nonlinear solvers that converge all the time. SPE Journal 15, 526–544. doi:10.2118/ 119147-PA
2009
-
[55]
Gradient-enhanced physics-informed neural networks for forward and inverse pde problems
Yu, J., Lu, L., Meng, X., Karniadakis, G.E., 2022. Gradient-enhanced physics-informed neural networks for forward and inverse pde problems. Computer Methods in Applied Mechanics and Engineering 393, 114823. doi:10.1016/j.cma.2022.114823. 46
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.