hub Canonical reference

DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators

Lu Lu, Pengzhan Jin, George Em Karniadakis · 2019 · cs.LG · arXiv 1910.03193

Canonical reference. 77% of citing Pith papers cite this work as background.

44 Pith papers citing it

Background 77% of classified citations

open full Pith review browse 44 citing papers arXiv PDF

abstract

While it is widely known that neural networks are universal approximators of continuous functions, a less known and perhaps more powerful result is that a neural network with a single hidden layer can approximate accurately any nonlinear continuous operator. This universal approximation theorem is suggestive of the potential application of neural networks in learning nonlinear operators from data. However, the theorem guarantees only a small approximation error for a sufficient large network, and does not consider the important optimization and generalization errors. To realize this theorem in practice, we propose deep operator networks (DeepONets) to learn operators accurately and efficiently from a relatively small dataset. A DeepONet consists of two sub-networks, one for encoding the input function at a fixed number of sensors $x_i, i=1,\dots,m$ (branch net), and another for encoding the locations for the output functions (trunk net). We perform systematic simulations for identifying two types of operators, i.e., dynamic systems and partial differential equations, and demonstrate that DeepONet significantly reduces the generalization error compared to the fully-connected networks. We also derive theoretically the dependence of the approximation error in terms of the number of sensors (where the input function is defined) as well as the input function type, and we verify the theorem with computational results. More importantly, we observe high-order error convergence in our computational tests, namely polynomial rates (from half order to fourth order) and even exponential convergence with respect to the training dataset size.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 11 method 2

citation-polarity summary

background 10 use method 2 unclear 1

representative citing papers

Therm-FM: Foundation Model is ALL YOU NEED for 3D-ICs Thermal Simulation

cs.CE · 2026-05-21 · unverdicted · novelty 7.0 · 2 refs

Therm-FM adapts a pretrained PDE foundation model using thermal-equivalent multi-fidelity training to achieve up to 10.6x lower error in 3D-IC thermal simulation with under 20% of typical training data and strong cross-design transfer.

Universal Approximation of Nonlinear Operators and Their Derivatives

cs.LG · 2026-05-14 · unverdicted · novelty 7.0 · 2 refs

Proves first UATs for k-times differentiable nonlinear operators and their derivatives via OL architectures uniformly on compact sets in weighted Bastiani-Sobolev spaces on general Banach spaces.

Constraint-Aware Flow Matching: Decision Aligned End-to-End Training for Constrained Sampling

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Constraint-Aware Flow Matching integrates constraint projections into the flow matching training objective to align model dynamics with constrained sampling and reduce distributional shift.

Approximation of Maximally Monotone Operators : A Graph Convergence Perspective

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Any maximally monotone operator can be approximated in local graph convergence by continuous encoder-decoder networks, with structure-preserving versions that retain maximal monotonicity via resolvent parameterizations.

Implicit Neural Optimal Transport via Fixed-Point Optimization

math.OC · 2026-05-11 · unverdicted · novelty 7.0

A single-network implicit neural optimal transport method that solves the c-transform via proximal fixed-point iteration for stable, non-adversarial training.

Stable Long-Horizon PDE Forecasting via Latent Structured Spectral Propagators

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

A latent Structured Spectral Propagator enables stable autoregressive PDE forecasting by decoupling spatial details from recurrent modal dynamics.

CATO: Charted Attention for Neural PDE Operators

cs.AI · 2026-05-09 · unverdicted · novelty 7.0

CATO learns a continuous latent chart for efficient axial attention on PDE meshes and adds derivative-aware supervision to improve accuracy and reduce oversmoothing on general geometries.

Physics-Informed Neural PDE Solvers via Spatio-Temporal MeanFlow

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

Spatio-Temporal MeanFlow adapts MeanFlow to PDEs by replacing the generative velocity field with the physical operator and extending the integral constraint to the spatio-temporal domain, yielding a unified solver for time-dependent and stationary equations with improved accuracy and generalization.

Geometry-Aware Neural Optimizer for Shape Optimization and Inversion

cs.LG · 2026-05-06 · conditional · novelty 7.0 · 3 refs

GANO is an end-to-end differentiable latent-space optimizer that unifies shape encoding, surrogate prediction, and controllable geometry updates for PDE-governed shape optimization and inversion.

AI models of unstable flow exhibit hallucination

physics.flu-dyn · 2026-04-22 · unverdicted · novelty 7.0

AI models of viscous fingering exhibit hallucinations from spectral bias; DeepFingers combines FNO and DeepONet with time-contrast conditioning to predict accurate finger dynamics while preserving mixing metrics.

DeepRitzSplit Neural Operator for Phase-Field Models via Energy Splitting

math.AP · 2026-04-20 · unverdicted · novelty 7.0

A DeepRitzSplit neural operator trained on energy-split variational forms enforces dissipation in phase-field models and outperforms data-driven training in generalization while running faster than Fourier spectral methods on Allen-Cahn and dendritic growth cases.

DiLO: Decoupling Generative Priors and Neural Operators via Diffusion Latent Optimization for Inverse Problems

math.NA · 2026-04-13 · unverdicted · novelty 7.0

DiLO turns diffusion sampling into deterministic latent optimization to satisfy the manifold consistency requirement for neural operators in inverse problem solving.

Is Flow Matching Just Trajectory Replay for Sequential Data?

stat.ML · 2026-02-09 · unverdicted · novelty 7.0

Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented dynamical system sampler.

CompNO: A Novel Foundation Model approach for solving Partial Differential Equations

cs.LG · 2026-01-12 · unverdicted · novelty 7.0

CompNO composes specialized Fourier neural operator blocks for fundamental differential operators into task-specific solvers that achieve lower L2 error than baselines on linear parametric PDEs and remain competitive on nonlinear flows while exactly satisfying boundaries.

Universal Approximation of Operators with Transformers and Neural Integral Operators

cs.LG · 2024-09-01 · unverdicted · novelty 7.0

Transformers and generalized neural integral operators are shown to universally approximate operators between Hölder and Banach spaces.

Neuromorphic Energy-Aware Learning for Adaptive Deep Brain Stimulation

cs.NE · 2026-06-26 · unverdicted · novelty 6.0

Energy-aware RL with a spiking Q-network in a brain circuit model cuts alpha-beta oscillations 45% and stimulation charge 80% vs continuous DBS, then deploys at 0.52 mW on neuromorphic hardware.

Kolmogorov Regression for Robust Diffusion Policies

cs.LG · 2026-06-16 · unverdicted · novelty 6.0

Kolmogorov regression lifts diffusion policies to Cameron-Martin space via PDEs and a precision-weighted loss, yielding convergence guarantees and empirical gains on PushT and manufacturing benchmarks.

WINO: A Weak-Form Physics Informed Neural Operator for Hyperelasticity on Variable Domains

math.NA · 2026-05-23 · unverdicted · novelty 6.0

WINO is a weak-form physics-informed neural operator for hyperelasticity on variable domains that uses phi-FEM for geometric flexibility and achieves accuracy below 0.04 while cutting computation time by 50-80% as warm starts for solvers.

Symplectic Neural Operators for Learning Infinite Dimensional Hamiltonian Systems

math.DS · 2026-05-15 · unverdicted · novelty 6.0

Symplectic Neural Operators preserve symplectic structure for learning infinite-dimensional Hamiltonian PDEs and deliver improved long-term energy stability in theory and experiments.

Compositional Neural Operators for Multi-Dimensional Fluid Dynamics

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

Compositional Neural Operators decompose multi-dimensional fluid PDEs into a library of pretrained elementary physics blocks assembled via an aggregator that minimizes data and physics residuals.

Don't Fix the Basis -- Learn It: Spectral Representation with Adaptive Basis Learning for PDEs

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

ABLE learns a spatially adaptive Parseval frame from data via an ancillary density to replace fixed bases in spectral neural operators for PDEs.

Continuity Laws for Sequential Models

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

S4 models exhibit stable time-continuity unlike sensitive S6 models, with task continuity predicting performance and enabling temporal subsampling for better efficiency.

Hierarchical Multi-Fidelity Learning for Predicting Three-Dimensional Flame Wrinkling and Turbulent Burning Velocity

cs.LG · 2026-05-06 · unverdicted · novelty 6.0

MuFiNNs integrates sparse experimental measurements with structured low-fidelity models via hierarchical construction and nonlinear correction to predict 3D flame wrinkling dynamics and turbulent mass burning velocity across fuels, pressures, and turbulence levels.

Late Fusion Neural Operators for Extrapolation Across Parameter Space in Partial Differential Equations

cs.LG · 2026-04-17 · unverdicted · novelty 6.0

Late Fusion Neural Operators disentangle state and parameter learning to outperform FNO and CAPE-FNO on advection, Burgers, and reaction-diffusion PDEs with 72% average RMSE reduction in and out of domain.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer