Sonny: Breaking the Compute Wall in Medium-Range Weather Forecasting

arXiv:2603.2128459.41 citationsh-index: 2
Predicted impact top 57% in LG · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the compute barrier for academic groups in weather forecasting, though it is incremental as it builds on existing deep learning methods.

The paper tackles the high computational cost of data-driven weather forecasting by introducing Sonny, an efficient hierarchical transformer that achieves competitive medium-range forecasting performance on WeatherBench2, with training feasible on a single NVIDIA A40 GPU in about 5.5 days.

Weather forecasting is a fundamental problem for protecting lives and infrastructure from high-impact atmospheric events. Recently, data-driven weather forecasting methods based on deep learning have demonstrated strong performance, often reaching accuracy levels competitive with operational numerical systems. However, many existing models rely on large-scale training regimes and compute-intensive architectures, which raises the practical barrier for academic groups with limited compute resources. Here we introduce Sonny, an efficient hierarchical transformer that achieves competitive medium-range forecasting performance while remaining feasible within reasonable compute budgets. At the core of Sonny is a two-stage StepsNet design: a narrow slow path first models large-scale atmospheric dynamics, and a subsequent full-width fast path integrates thermodynamic interactions. To stabilize medium-range rollout without an additional fine-tuning stage, we apply exponential moving average (EMA) during training. On WeatherBench2, Sonny yields robust medium-range forecast skill, remains competitive with operational baselines, and demonstrates clear advantages over FastNet, particularly at extended tropical lead times. In practice, Sonny can be trained to convergence on a single NVIDIA A40 GPU in approximately 5.5 days.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes