pith. sign in

hub Canonical reference

DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators

Canonical reference. 77% of citing Pith papers cite this work as background.

40 Pith papers citing it
Background 77% of classified citations
abstract

While it is widely known that neural networks are universal approximators of continuous functions, a less known and perhaps more powerful result is that a neural network with a single hidden layer can approximate accurately any nonlinear continuous operator. This universal approximation theorem is suggestive of the potential application of neural networks in learning nonlinear operators from data. However, the theorem guarantees only a small approximation error for a sufficient large network, and does not consider the important optimization and generalization errors. To realize this theorem in practice, we propose deep operator networks (DeepONets) to learn operators accurately and efficiently from a relatively small dataset. A DeepONet consists of two sub-networks, one for encoding the input function at a fixed number of sensors $x_i, i=1,\dots,m$ (branch net), and another for encoding the locations for the output functions (trunk net). We perform systematic simulations for identifying two types of operators, i.e., dynamic systems and partial differential equations, and demonstrate that DeepONet significantly reduces the generalization error compared to the fully-connected networks. We also derive theoretically the dependence of the approximation error in terms of the number of sensors (where the input function is defined) as well as the input function type, and we verify the theorem with computational results. More importantly, we observe high-order error convergence in our computational tests, namely polynomial rates (from half order to fourth order) and even exponential convergence with respect to the training dataset size.

hub tools

citation-role summary

background 11 method 2

citation-polarity summary

clear filters

representative citing papers

Universal Approximation of Nonlinear Operators and Their Derivatives

cs.LG · 2026-05-14 · unverdicted · novelty 8.0

Proves the first universal approximation theorems for k-times differentiable nonlinear operators between Banach spaces and their derivatives uniformly on compact sets in weighted Sobolev norms via encoder-decoder operator learning architectures.

CATO: Charted Attention for Neural PDE Operators

cs.AI · 2026-05-09 · unverdicted · novelty 7.0

CATO learns a continuous latent chart for efficient axial attention on PDE meshes and adds derivative-aware supervision to improve accuracy and reduce oversmoothing on general geometries.

Physics-Informed Neural PDE Solvers via Spatio-Temporal MeanFlow

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

Spatio-Temporal MeanFlow adapts MeanFlow to PDEs by replacing the generative velocity field with the physical operator and extending the integral constraint to the spatio-temporal domain, yielding a unified solver for time-dependent and stationary equations with improved accuracy and generalization.

AI models of unstable flow exhibit hallucination

physics.flu-dyn · 2026-04-22 · unverdicted · novelty 7.0

AI models of viscous fingering exhibit hallucinations from spectral bias; DeepFingers combines FNO and DeepONet with time-contrast conditioning to predict accurate finger dynamics while preserving mixing metrics.

DeepRitzSplit Neural Operator for Phase-Field Models via Energy Splitting

math.AP · 2026-04-20 · unverdicted · novelty 7.0

A DeepRitzSplit neural operator trained on energy-split variational forms enforces dissipation in phase-field models and outperforms data-driven training in generalization while running faster than Fourier spectral methods on Allen-Cahn and dendritic growth cases.

Is Flow Matching Just Trajectory Replay for Sequential Data?

stat.ML · 2026-02-09 · unverdicted · novelty 7.0

Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented dynamical system sampler.

Continuity Laws for Sequential Models

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

S4 models exhibit stable time-continuity unlike sensitive S6 models, with task continuity predicting performance and enabling temporal subsampling for better efficiency.

Deep Learning for Subspace Regression

cs.LG · 2025-09-27 · unverdicted · novelty 6.0

Neural networks regress oversized subspaces for parametric problems using subspace-specific losses, with theory and experiments showing improved accuracy and smoother mappings.

citing papers explorer

Showing 2 of 2 citing papers after filters.