LGJun 10, 2025

LaDCast: A Latent Diffusion Model for Medium-Range Ensemble Weather Forecasting

arXiv:2506.09193v16 citationsh-index: 6Has Code
Originality Highly original
AI Analysis

This work addresses the problem of accurate and efficient probabilistic weather forecasting for meteorologists and climate scientists, offering a novel approach that reduces computational costs and improves extreme event prediction.

The paper tackles medium-range ensemble weather forecasting by introducing LaDCast, a latent diffusion model that generates hourly ensemble forecasts in a learned latent space, achieving deterministic and probabilistic skill close to a leading operational model (European Centre for Medium-Range Forecast IFS-ENS) and superior performance in tracking extreme events like cyclones.

Accurate probabilistic weather forecasting demands both high accuracy and efficient uncertainty quantification, challenges that overburden both ensemble numerical weather prediction (NWP) and recent machine-learning methods. We introduce LaDCast, the first global latent-diffusion framework for medium-range ensemble forecasting, which generates hourly ensemble forecasts entirely in a learned latent space. An autoencoder compresses high-dimensional ERA5 reanalysis fields into a compact representation, and a transformer-based diffusion model produces sequential latent updates with arbitrary hour initialization. The model incorporates Geometric Rotary Position Embedding (GeoRoPE) to account for the Earth's spherical geometry, a dual-stream attention mechanism for efficient conditioning, and sinusoidal temporal embeddings to capture seasonal patterns. LaDCast achieves deterministic and probabilistic skill close to that of the European Centre for Medium-Range Forecast IFS-ENS, without any explicit perturbations. Notably, LaDCast demonstrates superior performance in tracking rare extreme events such as cyclones, capturing their trajectories more accurately than established models. By operating in latent space, LaDCast reduces storage and compute by orders of magnitude, demonstrating a practical path toward forecasting at kilometer-scale resolution in real time. We open-source our code and models and provide the training and evaluation pipelines at: https://github.com/tonyzyl/ladcast.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes