AO-PHLGJun 13, 2025

Arnoldi Singular Vector perturbations for machine learning weather prediction

arXiv:2506.22450v1
Originality Incremental advance
AI Analysis

This addresses uncertainty estimation for weather forecasting, which is crucial for reliable decision-making, but the method is incremental as it adapts existing singular vector techniques to ML models without linear or adjoint versions.

The paper tackles the problem of quantifying uncertainty in machine learning weather prediction by developing the Arnoldi-SV method to generate initial condition perturbations that capture error growth, showing it finds dynamically meaningful patterns for the 24h Pangu Weather model that grow from the start of forecasts.

Since weather forecasts are fundamentally uncertain, reliable decision making requires information on the likelihoods of future weather scenarios. We explore the sensitivity of machine learning weather prediction (MLWP) using the 24h Pangu Weather ML model of Huawei to errors in the initial conditions with a specific kind of Singular Vector (SV) perturbations. Our Arnoldi-SV (A-SV) method does not need linear nor adjoint model versions and is applicable to numerical weather prediction (NWP) as well as MLWP. It observes error growth within a given optimization time window by iteratively applying a forecast model to perturbed model states. This creates a Krylov subspace, implicitly based on a matrix operator, which approximates the local error growth. Each iteration adds new dimensions to the Krylov space and its leading right SVs are expected to turn into directions of growing errors. We show that A-SV indeed finds dynamically meaningful perturbation patterns for the 24h Pangu Weather model, which grow right from the beginning of the forecast rollout. These perturbations describe local unstable modes and could be a basis to initialize MLWP ensembles. Since we start A-SV from random noise perturbations, the algorithm transforms noise into perturbations conditioned on a given reference state - a process that is akin to the denoising process of the generic diffusion based ML model of GenCast, therefor we briefly discuss similarities and differences.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes