Therm-FM adapts a pretrained PDE foundation model using thermal-equivalent multi-fidelity training to achieve up to 10.6x lower error in 3D-IC thermal simulation with under 20% of typical training data and strong cross-design transfer.
hub Canonical reference
DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators
Canonical reference. 77% of citing Pith papers cite this work as background.
abstract
While it is widely known that neural networks are universal approximators of continuous functions, a less known and perhaps more powerful result is that a neural network with a single hidden layer can approximate accurately any nonlinear continuous operator. This universal approximation theorem is suggestive of the potential application of neural networks in learning nonlinear operators from data. However, the theorem guarantees only a small approximation error for a sufficient large network, and does not consider the important optimization and generalization errors. To realize this theorem in practice, we propose deep operator networks (DeepONets) to learn operators accurately and efficiently from a relatively small dataset. A DeepONet consists of two sub-networks, one for encoding the input function at a fixed number of sensors $x_i, i=1,\dots,m$ (branch net), and another for encoding the locations for the output functions (trunk net). We perform systematic simulations for identifying two types of operators, i.e., dynamic systems and partial differential equations, and demonstrate that DeepONet significantly reduces the generalization error compared to the fully-connected networks. We also derive theoretically the dependence of the approximation error in terms of the number of sensors (where the input function is defined) as well as the input function type, and we verify the theorem with computational results. More importantly, we observe high-order error convergence in our computational tests, namely polynomial rates (from half order to fourth order) and even exponential convergence with respect to the training dataset size.
hub tools
citation-role summary
citation-polarity summary
representative citing papers
Proves first UATs for k-times differentiable nonlinear operators and their derivatives via OL architectures uniformly on compact sets in weighted Bastiani-Sobolev spaces on general Banach spaces.
Constraint-Aware Flow Matching integrates constraint projections into the flow matching training objective to align model dynamics with constrained sampling and reduce distributional shift.
Any maximally monotone operator can be approximated in local graph convergence by continuous encoder-decoder networks, with structure-preserving versions that retain maximal monotonicity via resolvent parameterizations.
A single-network implicit neural optimal transport method that solves the c-transform via proximal fixed-point iteration for stable, non-adversarial training.
A latent Structured Spectral Propagator enables stable autoregressive PDE forecasting by decoupling spatial details from recurrent modal dynamics.
CATO learns a continuous latent chart for efficient axial attention on PDE meshes and adds derivative-aware supervision to improve accuracy and reduce oversmoothing on general geometries.
Spatio-Temporal MeanFlow adapts MeanFlow to PDEs by replacing the generative velocity field with the physical operator and extending the integral constraint to the spatio-temporal domain, yielding a unified solver for time-dependent and stationary equations with improved accuracy and generalization.
GANO is an end-to-end differentiable latent-space optimizer that unifies shape encoding, surrogate prediction, and controllable geometry updates for PDE-governed shape optimization and inversion.
AI models of viscous fingering exhibit hallucinations from spectral bias; DeepFingers combines FNO and DeepONet with time-contrast conditioning to predict accurate finger dynamics while preserving mixing metrics.
A DeepRitzSplit neural operator trained on energy-split variational forms enforces dissipation in phase-field models and outperforms data-driven training in generalization while running faster than Fourier spectral methods on Allen-Cahn and dendritic growth cases.
DiLO turns diffusion sampling into deterministic latent optimization to satisfy the manifold consistency requirement for neural operators in inverse problem solving.
Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented dynamical system sampler.
CompNO composes specialized Fourier neural operator blocks for fundamental differential operators into task-specific solvers that achieve lower L2 error than baselines on linear parametric PDEs and remain competitive on nonlinear flows while exactly satisfying boundaries.
Transformers and generalized neural integral operators are shown to universally approximate operators between Hölder and Banach spaces.
Energy-aware RL with a spiking Q-network in a brain circuit model cuts alpha-beta oscillations 45% and stimulation charge 80% vs continuous DBS, then deploys at 0.52 mW on neuromorphic hardware.
Kolmogorov regression lifts diffusion policies to Cameron-Martin space via PDEs and a precision-weighted loss, yielding convergence guarantees and empirical gains on PushT and manufacturing benchmarks.
WINO is a weak-form physics-informed neural operator for hyperelasticity on variable domains that uses phi-FEM for geometric flexibility and achieves accuracy below 0.04 while cutting computation time by 50-80% as warm starts for solvers.
Symplectic Neural Operators preserve symplectic structure for learning infinite-dimensional Hamiltonian PDEs and deliver improved long-term energy stability in theory and experiments.
Compositional Neural Operators decompose multi-dimensional fluid PDEs into a library of pretrained elementary physics blocks assembled via an aggregator that minimizes data and physics residuals.
ABLE learns a spatially adaptive Parseval frame from data via an ancillary density to replace fixed bases in spectral neural operators for PDEs.
S4 models exhibit stable time-continuity unlike sensitive S6 models, with task continuity predicting performance and enabling temporal subsampling for better efficiency.
MuFiNNs integrates sparse experimental measurements with structured low-fidelity models via hierarchical construction and nonlinear correction to predict 3D flame wrinkling dynamics and turbulent mass burning velocity across fuels, pressures, and turbulence levels.
Late Fusion Neural Operators disentangle state and parameter learning to outperform FNO and CAPE-FNO on advection, Burgers, and reaction-diffusion PDEs with 72% average RMSE reduction in and out of domain.
citing papers explorer
No citing papers match the current filters.