Cast3 translates NWP principles into a data-driven model using cubed-sphere grids, super-ensembles, and generative nudging to achieve state-of-the-art ensemble predictions that outperform baselines.
WeatherBench 2: A Benchmark for the Next Generation of Data-Driven Global Weather Models.Journal of Advances in Modeling Earth Systems, 16(6)
9 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 9roles
background 3polarities
background 3representative citing papers
ML climate emulators degrade under seasonal distribution shifts that proxy long-term climate change, but physically motivated compositional decompositions improve out-of-distribution performance with modest in-distribution trade-offs.
SwAIther-Precip uses lead-time-conditioned U-Net bias correction followed by diffusion-based super-resolution to downscale AIFS forecasts, achieving 48% CRPS reduction and ~4 km effective resolution up to 5 days lead time.
Extreme Weather Bench supplies standardized case studies, observational data, impact metrics, and code to evaluate weather models on high-impact hazards.
Probabilistic bias correction doubles AI subseasonal forecast skill and wins a 2025 international competition by correcting biases in ECMWF models for pressure, temperature, and precipitation.
HealDA supplies ML-based initial conditions for AI weather models that produce forecasts trailing ERA5-initialized runs by less than one day of effective lead time, with the skill gap arising mainly from initial error size.
An open-source tool is developed for mechanistic interpretability of AI weather models, demonstrated on GraphCast by identifying latent directions corresponding to interpretable weather features.
A PMP-based evaluation framework for testing deep-learning Earth system models on climate-relevant diagnostics beyond short-range forecasts.
citing papers explorer
-
Cast3: Translating numerical weather prediction principles into data-driven forecasting
Cast3 translates NWP principles into a data-driven model using cubed-sphere grids, super-ensembles, and generative nudging to achieve state-of-the-art ensemble predictions that outperform baselines.
-
No Epoch Like the Present: Robust Climate Emulation Requires Out-of-Distribution Generalisation
ML climate emulators degrade under seasonal distribution shifts that proxy long-term climate change, but physically motivated compositional decompositions improve out-of-distribution performance with modest in-distribution trade-offs.
-
SwAIther-Precip: Lead-Time-Aware Bias Correction Enables Kilometer-Scale Downscaling of Global AI Precipitation Forecasts over Switzerland
SwAIther-Precip uses lead-time-conditioned U-Net bias correction followed by diffusion-based super-resolution to downscale AIFS forecasts, achieving 48% CRPS reduction and ~4 km effective resolution up to 5 days lead time.
-
Extreme Weather Bench: A framework and benchmark for evaluation of high-impact weather
Extreme Weather Bench supplies standardized case studies, observational data, impact metrics, and code to evaluate weather models on high-impact hazards.
-
Enhancing AI and Dynamical Subseasonal Forecasts with Probabilistic Bias Correction
Probabilistic bias correction doubles AI subseasonal forecast skill and wins a 2025 international competition by correcting biases in ECMWF models for pressure, temperature, and precipitation.
-
HealDA: Highlighting the importance of initial errors in end-to-end AI weather forecasts
HealDA supplies ML-based initial conditions for AI weather models that produce forecasts trailing ERA5-initialized runs by less than one day of effective lead time, with the skill gap arising mainly from initial error size.
-
Mechanistic Interpretability Tool for AI Weather Models
An open-source tool is developed for mechanistic interpretability of AI weather models, demonstrated on GraphCast by identifying latent directions corresponding to interpretable weather features.
-
A PMP-inspired Evaluation Framework for Assessing Deep-Learning Earth System Models
A PMP-based evaluation framework for testing deep-learning Earth system models on climate-relevant diagnostics beyond short-range forecasts.
- U-Cast: A Surprisingly Simple and Efficient Frontier Probabilistic AI Weather Forecaster