Random test functions, $H^{-1}$ norm equivalence, and stochastic variational physics-informed neural networks

Diego Marcondes

arxiv: 2605.03542 · v2 · pith:WW65ATP3new · submitted 2026-05-05 · 🧮 math.NA · cs.LG· cs.NA

Random test functions, H⁻¹ norm equivalence, and stochastic variational physics-informed neural networks

Diego Marcondes This is my paper

Pith reviewed 2026-05-07 14:49 UTC · model grok-4.3

classification 🧮 math.NA cs.LGcs.NA

keywords H^{-1} norm equivalencerandom test functionsstochastic variational PINNsweak solutionsphysics-informed neural networkselliptic PDEsSobolev spaces

0 comments

The pith

The H^{-1} norm of any functional is equivalent to its expected squared evaluation against a random test function whose distribution depends only on the domain.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proves that the H^{-1} norm of any functional can be recovered exactly by taking the expectation of its squared evaluation on samples from a random test function distribution that depends solely on the domain. This holds even though the test functions have negative Sobolev regularity in dimensions two and higher. The equivalence defines stochastically weak solutions that are the same as classical weak solutions. It leads to a new class of neural network methods for solving PDEs by minimizing an empirical stochastic norm of the residual instead of a deterministic one.

Core claim

The central discovery is that the H^{-1} norm of any functional is equivalent to its expected squared evaluation against a random test function whose distribution depends only on the domain. Realisations of this random test function have negative Sobolev regularity for d greater than or equal to 2, yet this roughness does not prevent averaging over the distribution from exactly recovering the correct weak topology, independently of the differential operator. This equivalence introduces stochastically weak solutions, which coincide with classical weak solutions, and motivates stochastic variational physics-informed neural networks trained by minimising an empirical approximation of the stocha

What carries the argument

A domain-dependent probability distribution over test functions such that the expected value of the squared pairing with any functional recovers the H^{-1} norm and the associated weak topology.

Load-bearing premise

There exists a distribution over random test functions, depending only on the domain, for which the expected squared evaluation exactly equals the H^{-1} norm of the functional for any functional.

What would settle it

A concrete counterexample consisting of a domain, a specific functional, and its H^{-1} norm that does not match the computed expectation over samples from the proposed random test function distribution.

Figures

Figures reproduced from arXiv: 2605.03542 by Diego Marcondes.

**Figure 1.** Figure 1: Examples of realisations of Φ. (a) Sampled by (4.10) with τ “ 10 for d “ 1, 2, 3 in a grid with, respectively, 1024, 10242 and 5123 points. For d “ 3 we present the slice φp0.5, ¨, ¨q of the sampled realisation. (b) Sampled via the truncated sum (4.12) with M “ 4, 096, τ “ 0.1 and 22, 500 points sampled uniformly. (c) Sampled by truncating the sum (4.13) with M “ 1, 024, τ “ 0.1 and 7, 105 points in a grid… view at source ↗

**Figure 2.** Figure 2: (a) Solutions of the boundary value problems in Experiment 1 (6.3) (solid line in red) with the approximation by the SV-PINNs with DAFF trained with L-BFGS for 5, 000 steps (dashed line in blue), and the pointwise error between them. (b) Average L 2 relative error with ˘ one standard error over the 3 repetitions at all training steps for each method, optimiser and architecture features for solving the boun… view at source ↗

**Figure 3.** Figure 3: Solution of the boundary value problems in Experiment 2 (6.4), the approximation by an SV-PINN trained with L-BFGS for 5, 000 steps and its pointwise error view at source ↗

**Figure 4.** Figure 4: Average L 2 relative error with ˘ one standard error over the 3 repetitions at all training steps for each method and optimiser for solving the boundary value problems in Experiment 2 (6.4). training steps. The performance of SV-PINNs was more stable, i.e., smaller standard deviations, over the three repetitions than that of PINNs. The SV-PINNs converged with significantly fewer steps than PINNs, attaini… view at source ↗

**Figure 5.** Figure 5: (a) Solution of the boundary value problem in Experiment 3 (6.5), the approximation by an SV-PINN trained with L-BFGS for 5, 000 steps and its pointwise error. (b) Average L 2 relative error with ˘ one standard error over the 3 repetitions at all training steps for each method and optimiser. 6.4. Experiment 4: Helmholtz equation in 2D. To further increase the complexity, we consider the two-dimensional Hel… view at source ↗

**Figure 6.** Figure 6: (a) Solution of the boundary value problem in Experiment 4 (6.6), the approximation by an SV-PINN trained with L-BFGS for 5, 000 steps and its pointwise error. (b) Average L 2 relative error with ˘ one standard error over the 3 repetitions at all training steps for each method and optimiser. As in the previous experiments, the SV-PINNs trained with L-BFGS were able to recover the solution with an L 2 relat… view at source ↗

**Figure 7.** Figure 7: Solution of the boundary value problems in Experiment 5 (6.7), the approximation by an SV-PINN trained with L-BFGS for 5, 000 steps and its pointwise error view at source ↗

**Figure 8.** Figure 8: Average L 2 relative error with ˘ one standard error over the 3 repetitions at all training steps for each method and optimiser for solving the boundary value problems in Experiment 5 (6.7). these frequencies are more compatible with the solution for higher values of k. As with the other examples, the convergence of the SV-PINNs trained by L-BFGS is significantly faster, achieving an L 2 relative error lo… view at source ↗

**Figure 9.** Figure 9: Solution of the boundary value problems in Experiment 6 (6.8), the approximation by an SV-PINN trained with L-BFGS for 5, 000 steps and its pointwise error. We consider a slice of the respective functions with the first variable x “ 0.5. As expected, the PINNs perform particularly well, especially for higher values of k, attaining L 2 relative errors on average between 6.7 ˆ 10´3 and 8.5 ˆ 10´3 , outperfo… view at source ↗

**Figure 10.** Figure 10: Average L 2 relative error with ˘ one standard error over the 3 repetitions at all training steps for each method and optimiser for solving the boundary value problems in Experiment 6 (6.8) view at source ↗

**Figure 11.** Figure 11: Solution of the boundary value problems in Experiment 7 (7.1), the approximation by an SV-PINN trained with L-BFGS for 5, 000 steps and its pointwise error. We consider an architecture with 64 DAFF and one hidden layer with 1, 024 nodes, τ “ 0.1 for numerical stability, the collocation points in a grid with 7, 105 points, and the same training methods as in the previous sections. In order to compute the … view at source ↗

**Figure 12.** Figure 12: Average L 2 relative error with ˘ one standard error over the 3 repetitions at all training steps for each method and optimiser for solving the boundary value problems in Experiment 7 (7.1) view at source ↗

**Figure 13.** Figure 13: Solution of the boundary value problems in Experiment 8 (8.1), the approximation by an SV-PINN trained with L-BFGS for 5, 000 steps and its pointwise error. norm applicable to broader classes of PDEs. We emphasise that the following extensions are presented at a conceptual level and detailed analytical and numerical investigations are left for future work. 9.1. High-order elliptic operators and poly-har… view at source ↗

**Figure 14.** Figure 14: Average L 2 relative error with ˘ one standard error over the 3 repetitions at all training steps for each method and optimiser for solving the boundary value problems in Experiment 8 (8.1). in which B iu Bν i is the i-th normal derivative. For instance, B 0u Bν 0 “ u|BΩ and B 1u Bν 1 “ ∇u ¨ ν in which ν is the normal vector. To characterise the weak solutions of (9.1), we consider the eigenvalues and ei… view at source ↗

read the original abstract

The dual norm characterisation of weak solutions of second-order linear elliptic partial differential equations is mathematically natural but computationally intractable: evaluating the $H^{-1}$ norm of the residual requires a supremum over an infinite-dimensional test space. We prove that the $H^{-1}$ norm of any functional is equivalent to its expected squared evaluation against a random test function whose probability distribution depends only on the domain. Crucially, realisations of this random test function have negative Sobolev regularity for $d \geq 2$, yet this roughness is not an obstacle: averaging over the distribution exactly recovers the correct weak topology, independently of the differential operator, and no supremum evaluation is necessary. This equivalence introduces the notion of stochastically weak solutions, which coincide with classical weak solutions, and motivates stochastic variational physics-informed neural networks (SV-PINNs): neural networks trained by minimising an empirical approximation of the stochastic norm of the PDE residual. Although instantiated here with neural networks, the underlying principle is independent of the trial space and suggests a broader paradigm for numerical methods based on stochastic rather than deterministic test spaces. The framework extends naturally to higher-order elliptic, parabolic and hyperbolic equations and to abstract operator equations on Hilbert spaces. As a proof of concept, we present numerical experiments on eight challenging second-order linear elliptic problems spanning high-frequency and multi-scale solutions, indefinite operators, variable coefficients, and non-standard domains, in which SV-PINNs consistently and significantly outperform standard PINNs, recovering solutions to within one percent relative error in hundreds of L-BFGS steps.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a clean functional-analytic construction for recovering the H^{-1} norm from an expectation over random test functions whose distribution depends only on the domain, then uses it to train SV-PINNs that beat standard PINNs on eight elliptic test cases.

read the letter

The central new piece is the observation that the H^{-1} norm of any functional equals the square root of the expected squared pairing against a random test function whose covariance is the inverse Riesz map of the H^1_0 inner product on the domain. This makes the quadratic form match the dual norm exactly and keeps the topology right even when the sample paths are rough for d greater than or equal to 2. Because the distribution is fixed by the domain alone, the same random test functions work for any differential operator, which is the practical payoff. They turn this into stochastic variational PINNs by replacing the usual supremum in the residual norm with an empirical average over samples from that distribution. The numerical section tests the approach on eight second-order linear elliptic problems that include high-frequency solutions, multi-scale coefficients, indefinite operators, variable coefficients, and non-standard domains. In each case the SV-PINN reaches relative errors below one percent in a few hundred L-BFGS steps while standard PINNs lag behind. The math behind the norm equivalence looks solid once you see the Riesz-map construction; it is not a fitted surrogate but a direct consequence of the inner-product definition. The experiments supply concrete evidence that the stochastic loss is easier to optimize than the usual variational one. The main soft spot is that the paper does not report how many samples are drawn per gradient step or how the random functions are discretized, so it is hard to judge the variance of the estimator or its sensitivity to implementation choices. The broader claims about parabolic, hyperbolic, and abstract operator equations are stated but not demonstrated. This is useful reading for anyone working on variational formulations inside scientific machine learning or on tractable surrogates for dual norms in PDE solvers. It is worth sending to peer review because the core construction is straightforward to check and the reported gains are specific enough to be worth verifying.

Referee Report

0 major / 2 minor

Summary. The manuscript proves that the H^{-1} norm of any functional is equivalent to the square root of its expected squared pairing against a random test function whose law depends only on the domain. This equivalence is independent of the differential operator, yields stochastically weak solutions that coincide with classical weak solutions, and motivates SV-PINNs: neural networks trained by minimizing an empirical Monte-Carlo approximation to the stochastic residual norm. The framework is illustrated on eight second-order linear elliptic problems (high-frequency, multi-scale, indefinite, variable-coefficient, non-standard domains) where SV-PINNs recover solutions to within 1% relative error in a few hundred L-BFGS iterations and consistently outperform standard PINNs.

Significance. If the equivalence is rigorously established, the work supplies a computationally tractable, operator-independent surrogate for the dual norm that removes the need to solve auxiliary supremum problems. This is a genuine advance for variational PINNs and related mesh-free methods. The numerical evidence on a deliberately diverse test suite, together with the claim that negative Sobolev regularity of the random fields does not obstruct recovery of the H^{-1} topology, strengthens the practical case. The extension sketched to higher-order, parabolic, hyperbolic and abstract Hilbert-space problems further increases potential impact.

minor comments (2)

[Abstract] Abstract: the statement that solutions are recovered “to within one percent relative error” should specify the norm (L^2, H^1, etc.) in which the error is measured and indicate whether the figure is an average or worst-case value over the eight test problems.
[Numerical experiments] Implementation details for sampling the random test functions (discretization of the covariance operator, number of Monte-Carlo samples per loss evaluation, and handling of the negative-order regularity for d ≥ 2) are only sketched; a short algorithmic box or pseudocode would improve reproducibility.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive and encouraging report, which accurately summarizes the main theoretical result on the H^{-1} equivalence and its use in SV-PINNs, as well as the numerical experiments on diverse elliptic problems. We appreciate the assessment of significance and the recommendation for minor revision.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper's central result is a functional-analytic existence proof: there exists a random test function distribution (depending only on the domain) such that the expected squared duality pairing recovers the H^{-1} norm exactly. This is achieved by taking the covariance operator of the random field to be the inverse Riesz map of the H^1_0 inner product, which is a standard, operator-independent construction on the domain. The equivalence therefore holds by the definition of covariance and the Riesz representation theorem, but this constitutes a valid mathematical demonstration rather than a self-referential loop or a fitted quantity renamed as a prediction. No load-bearing step in the derivation reduces to its own inputs by construction; the subsequent SV-PINN training procedure follows from the established equivalence without circularity. The argument is self-contained against external functional-analytic benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on standard functional-analysis properties of Sobolev spaces together with the paper-specific assertion that a domain-dependent random distribution recovers the dual norm via expectation; no free parameters or new physical entities are introduced.

axioms (2)

standard math Standard properties of Sobolev spaces H^1 and its dual H^{-1} for second-order elliptic operators
Invoked to define weak solutions and the dual-norm characterisation of residuals.
ad hoc to paper Existence of a random test-function distribution depending only on the domain such that expectation of squared evaluations recovers the H^{-1} norm independently of the operator
This is the load-bearing statement proved in the paper and required for the stochastic formulation to be equivalent to the classical weak form.

pith-pipeline@v0.9.0 · 5579 in / 1511 out tokens · 158275 ms · 2026-05-07T14:49:20.046692+00:00 · methodology

Review history (2 revisions) →

Random test functions, H⁻¹ norm equivalence, and stochastic variational physics-informed neural networks

Core claim

What carries the argument

Load-bearing premise

What would settle it

discussion (0)