LGAIAO-PHGEO-PHSep 10, 2025

MoWE : A Mixture of Weather Experts

arXiv:2509.09052v11 citationsh-index: 27
Originality Highly original
AI Analysis

This provides a computationally efficient strategy to improve weather forecasting accuracy, though it is incremental as it builds on existing models rather than creating new ones.

The paper tackled the plateau in data-driven weather models by introducing a Mixture of Experts (MoWE) approach that optimally combines existing models, achieving up to a 10% lower RMSE than the best AI weather model on a 2-day forecast horizon.

Data-driven weather models have recently achieved state-of-the-art performance, yet progress has plateaued in recent years. This paper introduces a Mixture of Experts (MoWE) approach as a novel paradigm to overcome these limitations, not by creating a new forecaster, but by optimally combining the outputs of existing models. The MoWE model is trained with significantly lower computational resources than the individual experts. Our model employs a Vision Transformer-based gating network that dynamically learns to weight the contributions of multiple "expert" models at each grid point, conditioned on forecast lead time. This approach creates a synthesized deterministic forecast that is more accurate than any individual component in terms of Root Mean Squared Error (RMSE). Our results demonstrate the effectiveness of this method, achieving up to a 10% lower RMSE than the best-performing AI weather model on a 2-day forecast horizon, significantly outperforming individual experts as well as a simple average across experts. This work presents a computationally efficient and scalable strategy to push the state of the art in data-driven weather prediction by making the most out of leading high-quality forecast models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes