hub Canonical reference

DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators

Lu Lu, Pengzhan Jin, George Em Karniadakis · 2019 · cs.LG · arXiv 1910.03193

Canonical reference. 77% of citing Pith papers cite this work as background.

40 Pith papers citing it

Background 77% of classified citations

open full Pith review browse 40 citing papers arXiv PDF

abstract

While it is widely known that neural networks are universal approximators of continuous functions, a less known and perhaps more powerful result is that a neural network with a single hidden layer can approximate accurately any nonlinear continuous operator. This universal approximation theorem is suggestive of the potential application of neural networks in learning nonlinear operators from data. However, the theorem guarantees only a small approximation error for a sufficient large network, and does not consider the important optimization and generalization errors. To realize this theorem in practice, we propose deep operator networks (DeepONets) to learn operators accurately and efficiently from a relatively small dataset. A DeepONet consists of two sub-networks, one for encoding the input function at a fixed number of sensors $x_i, i=1,\dots,m$ (branch net), and another for encoding the locations for the output functions (trunk net). We perform systematic simulations for identifying two types of operators, i.e., dynamic systems and partial differential equations, and demonstrate that DeepONet significantly reduces the generalization error compared to the fully-connected networks. We also derive theoretically the dependence of the approximation error in terms of the number of sensors (where the input function is defined) as well as the input function type, and we verify the theorem with computational results. More importantly, we observe high-order error convergence in our computational tests, namely polynomial rates (from half order to fourth order) and even exponential convergence with respect to the training dataset size.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 11 method 2

citation-polarity summary

background 10 use method 2 unclear 1

representative citing papers

Universal Approximation of Nonlinear Operators and Their Derivatives

cs.LG · 2026-05-14 · unverdicted · novelty 8.0

Proves the first universal approximation theorems for k-times differentiable nonlinear operators between Banach spaces and their derivatives uniformly on compact sets in weighted Sobolev norms via encoder-decoder operator learning architectures.

Constraint-Aware Flow Matching: Decision Aligned End-to-End Training for Constrained Sampling

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Constraint-Aware Flow Matching integrates constraint projections into the flow matching training objective to align model dynamics with constrained sampling and reduce distributional shift.

Approximation of Maximally Monotone Operators : A Graph Convergence Perspective

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Any maximally monotone operator can be approximated in local graph convergence by continuous encoder-decoder networks, with structure-preserving versions that retain maximal monotonicity via resolvent parameterizations.

Stable Long-Horizon PDE Forecasting via Latent Structured Spectral Propagators

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

A latent Structured Spectral Propagator enables stable autoregressive PDE forecasting by decoupling spatial details from recurrent modal dynamics.

CATO: Charted Attention for Neural PDE Operators

cs.AI · 2026-05-09 · unverdicted · novelty 7.0

CATO learns a continuous latent chart for efficient axial attention on PDE meshes and adds derivative-aware supervision to improve accuracy and reduce oversmoothing on general geometries.

Physics-Informed Neural PDE Solvers via Spatio-Temporal MeanFlow

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

Spatio-Temporal MeanFlow adapts MeanFlow to PDEs by replacing the generative velocity field with the physical operator and extending the integral constraint to the spatio-temporal domain, yielding a unified solver for time-dependent and stationary equations with improved accuracy and generalization.

Geometry-Aware Neural Optimizer for Shape Optimization and Inversion

cs.LG · 2026-05-06 · conditional · novelty 7.0 · 3 refs

GANO is an end-to-end differentiable latent-space optimizer that unifies shape encoding, surrogate prediction, and controllable geometry updates for PDE-governed shape optimization and inversion.

AI models of unstable flow exhibit hallucination

physics.flu-dyn · 2026-04-22 · unverdicted · novelty 7.0

AI models of viscous fingering exhibit hallucinations from spectral bias; DeepFingers combines FNO and DeepONet with time-contrast conditioning to predict accurate finger dynamics while preserving mixing metrics.

DeepRitzSplit Neural Operator for Phase-Field Models via Energy Splitting

math.AP · 2026-04-20 · unverdicted · novelty 7.0

A DeepRitzSplit neural operator trained on energy-split variational forms enforces dissipation in phase-field models and outperforms data-driven training in generalization while running faster than Fourier spectral methods on Allen-Cahn and dendritic growth cases.

DiLO: Decoupling Generative Priors and Neural Operators via Diffusion Latent Optimization for Inverse Problems

math.NA · 2026-04-13 · unverdicted · novelty 7.0

DiLO turns diffusion sampling into deterministic latent optimization to satisfy the manifold consistency requirement for neural operators in inverse problem solving.

Is Flow Matching Just Trajectory Replay for Sequential Data?

stat.ML · 2026-02-09 · unverdicted · novelty 7.0

Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented dynamical system sampler.

CompNO: A Novel Foundation Model approach for solving Partial Differential Equations

cs.LG · 2026-01-12 · unverdicted · novelty 7.0

CompNO composes specialized Fourier neural operator blocks for fundamental differential operators into task-specific solvers that achieve lower L2 error than baselines on linear parametric PDEs and remain competitive on nonlinear flows while exactly satisfying boundaries.

Universal Approximation of Operators with Transformers and Neural Integral Operators

cs.LG · 2024-09-01 · unverdicted · novelty 7.0

Transformers and generalized neural integral operators are shown to universally approximate operators between Hölder and Banach spaces.

Symplectic Neural Operators for Learning Infinite Dimensional Hamiltonian Systems

math.DS · 2026-05-15 · unverdicted · novelty 6.0

Symplectic Neural Operators preserve symplectic structure for learning infinite-dimensional Hamiltonian PDEs and deliver improved long-term energy stability in theory and experiments.

Compositional Neural Operators for Multi-Dimensional Fluid Dynamics

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

Compositional Neural Operators decompose multi-dimensional fluid PDEs into a library of pretrained elementary physics blocks assembled via an aggregator that minimizes data and physics residuals.

Don't Fix the Basis -- Learn It: Spectral Representation with Adaptive Basis Learning for PDEs

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

ABLE learns a spatially adaptive Parseval frame from data via an ancillary density to replace fixed bases in spectral neural operators for PDEs.

Continuity Laws for Sequential Models

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

S4 models exhibit stable time-continuity unlike sensitive S6 models, with task continuity predicting performance and enabling temporal subsampling for better efficiency.

Hierarchical Multi-Fidelity Learning for Predicting Three-Dimensional Flame Wrinkling and Turbulent Burning Velocity

cs.LG · 2026-05-06 · unverdicted · novelty 6.0

MuFiNNs integrates sparse experimental measurements with structured low-fidelity models via hierarchical construction and nonlinear correction to predict 3D flame wrinkling dynamics and turbulent mass burning velocity across fuels, pressures, and turbulence levels.

Late Fusion Neural Operators for Extrapolation Across Parameter Space in Partial Differential Equations

cs.LG · 2026-04-17 · unverdicted · novelty 6.0

Late Fusion Neural Operators disentangle state and parameter learning to outperform FNO and CAPE-FNO on advection, Burgers, and reaction-diffusion PDEs with 72% average RMSE reduction in and out of domain.

Hyperfastrl: Hypernetwork-based reinforcement learning for unified control of parametric chaotic PDEs

cs.CE · 2026-04-07 · unverdicted · novelty 6.0

Hypernetworks map a forcing parameter directly to policy weights in an RL framework, enabling unified stabilization of the Kuramoto-Sivashinsky equation across regimes with KAN architectures showing strongest extrapolation.

Certified and accurate computation of function space norms of deep neural networks

math.NA · 2026-03-06 · unverdicted · novelty 6.0

A certified adaptive quadrature framework computes guaranteed L^p, W^{1,p}, and W^{2,p} norms of deep neural networks by propagating interval enclosures on axis-aligned boxes.

Generalized Spherical Neural Operators: Green's Function Formulation

cs.LG · 2025-12-11 · unverdicted · novelty 6.0

GSNO uses position-dependent spherical Green's functions to create flexible neural operators that adapt to non-equivariant systems on spheres while keeping spectral efficiency and grid invariance.

Differentiable Autoencoding Neural Operator for Interpretable and Integrable Latent Space Modeling

cs.LG · 2025-09-30 · unverdicted · novelty 6.0

DIANO builds coarse-grid latent spaces for fluid dynamics data via neural operator encoding and decoding while integrating a differentiable PDE solver directly in the latent space for end-to-end physics-constrained training.

Deep Learning for Subspace Regression

cs.LG · 2025-09-27 · unverdicted · novelty 6.0

Neural networks regress oversized subspaces for parametric problems using subspace-specific losses, with theory and experiments showing improved accuracy and smoother mappings.

citing papers explorer

Showing 10 of 10 citing papers after filters.

Generalized Spherical Neural Operators: Green's Function Formulation cs.LG · 2025-12-11 · unverdicted · none · ref 11 · internal anchor
GSNO uses position-dependent spherical Green's functions to create flexible neural operators that adapt to non-equivariant systems on spheres while keeping spectral efficiency and grid invariance.
Differentiable Autoencoding Neural Operator for Interpretable and Integrable Latent Space Modeling cs.LG · 2025-09-30 · unverdicted · none · ref 22 · internal anchor
DIANO builds coarse-grid latent spaces for fluid dynamics data via neural operator encoding and decoding while integrating a differentiable PDE solver directly in the latent space for end-to-end physics-constrained training.
Deep Learning for Subspace Regression cs.LG · 2025-09-27 · unverdicted · none · ref 22 · internal anchor
Neural networks regress oversized subspaces for parametric problems using subspace-specific losses, with theory and experiments showing improved accuracy and smoother mappings.
Latent Space Dynamics Identification for Interface Tracking with Application to Shock-Induced Pore Collapse physics.comp-ph · 2025-07-14 · unverdicted · none · ref 38 · internal anchor
LaSDI-IT learns latent linear dynamics for interface tracking via a revised autoencoder and Gaussian process interpolation, achieving under 9% error and 106x speedup on shock-induced pore collapse in high explosives.
Operator Learning for Schr\"{o}dinger Equation: Unitarity, Error Bounds, and Time Generalization stat.ML · 2025-05-23 · unverdicted · none · ref 4 · internal anchor
A linear estimator for the Schrödinger evolution operator is introduced that enforces weak unitarity, supplies uniform prediction error bounds and time-extrapolation bounds, and reports up to 100x lower relative error than FNO and DeepONet on hydrogen, ion-trap, and optical-lattice Hamiltonians.
On the definition and importance of interpretability in scientific machine learning cs.LG · 2025-05-16 · conditional · none · ref 45 · internal anchor
Interpretability in SciML requires mechanistic understanding rather than sparsity, and prior knowledge is often essential for interpretable scientific discovery.
Teaching Artificial Intelligence to Perform Rapid, Resolution-Invariant Grain Growth Modeling via Fourier Neural Operator cond-mat.mtrl-sci · 2025-03-18 · unverdicted · none · ref 45 · internal anchor
FNO surrogate model learns to predict long-term grain growth evolution from phase-field data while remaining accurate on unseen configurations and higher-resolution grids.
ATHENA: Agentic Team for Hierarchical Evolutionary Numerical Algorithms cs.LG · 2025-12-03 · unverdicted · none · ref 76 · internal anchor
ATHENA introduces an agentic team framework that autonomously manages the end-to-end computational research lifecycle via a knowledge-driven HENA loop to achieve validation errors of 10^{-14} in scientific computing and machine learning tasks.
XRePIT: A deep learning-computational fluid dynamics hybrid framework implemented in OpenFOAM for fast, robust, and scalable unsteady simulations cs.LG · 2025-10-21 · unverdicted · none · ref 26 · internal anchor
XRePIT automates residual-guided switching between neural surrogates and OpenFOAM to enable stable, up to 2.91x faster 3D unsteady flow simulations with L2 errors around 1E-03.
A Practitioner's Guide to Kolmogorov-Arnold Networks cs.LG · 2025-10-28 · accept · none · ref 156 · internal anchor
A systematic review of Kolmogorov-Arnold Networks that maps their relation to Kolmogorov superposition theory, MLPs, and kernels, examines basis-function design choices, summarizes performance advances, and supplies a practitioner's selection guide plus open challenges.

DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer