An ML model trained only on harmonized gridded observations achieves competitive medium-range weather forecast skill with the IFS for several upper-air and surface headline scores when verified against observations.
Canonical reference
Accurate medium-range global weather forecasting with 3D neural networks.Nature, 619(7970):533– 538, July 2023
Canonical reference. 78% of citing Pith papers cite this work as background.
citation-role summary
citation-polarity summary
representative citing papers
STRATA is the first autoregressive transformer emulator for global 4.9-km storm-resolving atmospheric dynamics, achieving 50x better energy efficiency than the underlying physics model while producing realistic km-scale features in 24-hour forecasts.
Forward Flux Sampling applied to a 1-degree neural weather emulator resolves conditional tropical cyclogenesis rates spanning three orders of magnitude across 98 Atlantic initial conditions, with self-consistency ratio 1.03 to direct sampling and computational gains up to 140X.
MotifGen is the first multi-source generative model for spatiotemporal interpolation of misaligned microwave cyclone images from heterogeneous instruments at irregular intervals, achieving lower CRPS via self-supervised training and closer power spectra than deterministic baselines when combining in
Targeted perturbations in the Aurora AI model can steer Hurricane Sandy's trajectory by more than 500 km after seven days via amplification in sensitive regions identified by FTLE and wave activity diagnostics.
AI weather models may simulate the atmosphere via particle positions in latent space whose updates follow gradient flow on a learned free energy functional rather than conventional physical equations.
Cast3 translates NWP principles into a data-driven model using cubed-sphere grids, super-ensembles, and generative nudging to achieve state-of-the-art ensemble predictions that outperform baselines.
Historically trained ML weather emulators quantify fast precipitation changes from CO2 perturbations and produce results that agree with Earth System Models.
Two new global-domain smoothing methods enable spatial verification scores like FSS on high-resolution global precipitation forecasts while handling grid area variability and missing data.
Online conformal prediction post-processing guarantees calibrated uncertainty coverage for GenCast, NeuralGCM, and AIFS-ENS forecasts of temperature and precipitation including extremes.
Tyan-WP is a pretrained wind power foundation model that outperforms site-specific TSMs and generic LTSMs in zero-shot ultra-short-term probabilistic forecasting on U.S. and U.K. sites via static embeddings and PAMF module.
NTK-UQ produces 31-37% sharper 90% prediction intervals than split conformal prediction for extreme weather forecasts, with adaptive scaling via architecture-dependent eigenvalue truncation and ICA decomposition of last-layer features.
Mechanism learning infers active local evolution rules via prototype-anchored descriptors to achieve more robust forecasting than direct state prediction on benchmarks like Burgers, WeatherBench2, and Lorenz96.
Wavelet Flow Matching emulates multi-scale PDE-governed systems by transporting velocities directly in a hierarchical wavelet representation via U-Net, yielding improved long-horizon stability and spectral accuracy on fluid benchmarks.
Q-SRDRN multi-quantile network with pinball loss and per-quantile heads detects extreme precipitation events up to 18 times more effectively than deterministic baselines while preserving augmentation benefits for the median.
ShardTensor is a domain-parallelism system for SciML that enables flexible scaling of extreme-resolution spatial datasets by removing the constraint of batch size one per device.
AxiomOcean deploys a 3D encoder-backbone-decoder architecture that jointly predicts upper-ocean variables and outperforms prior AI models by 20-35% in day-1 RMSE while preserving eddy kinetic energy and vertical consistency.
Extreme Weather Bench supplies standardized case studies, observational data, impact metrics, and code to evaluate weather models on high-impact hazards.
ESFM is a single open foundation model that unifies heterogeneous Earth data sources and forecasts missing regions while preserving inter-variable physical relationships.
A physics-constrained consistency model downscales Greenland SMB and surface temperature by a factor of 32 while preserving coarse-scale sums and outperforming interpolation on test metrics.
Extends Potential CRPS with weights and IDR post-processing to enable fair comparisons of AIWP and NWP models on extreme weather, finding AI models more informative across most variables and thresholds.
Instrumented data augments observations with mechanistic models, uncertainty, and counterfactuals to enable causal interventions via Pearl's do-operator in scientific machine learning.
A MATLAB/ONNX testbed integrates the Pangu AI model with PID closed-loop control to perform single-input single-output perturbation-response experiments on typhoon track and intensity.
An open-source tool is developed for mechanistic interpretability of AI weather models, demonstrated on GraphCast by identifying latent directions corresponding to interpretable weather features.