LGEPAIDCAO-PHMay 7, 2025

ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling

arXiv:2505.04802v28 citationsh-index: 42SC
Originality Highly original
AI Analysis

This addresses the need for robust regional climate predictions for decision-makers, representing a domain-specific advancement with potential broad impact in weather and climate modeling.

The paper tackles the problem of limited generalization and computational inefficiency in AI methods for climate downscaling by introducing ORBIT-2, a scalable foundation model that achieves high accuracy with R² scores of 0.98-0.99 on benchmarks and scales to 10 billion parameters with up to 4.1 exaFLOPS throughput.

Sparse observations and coarse-resolution climate models limit effective regional decision-making, underscoring the need for robust downscaling. However, existing AI methods struggle with generalization across variables and geographies and are constrained by the quadratic complexity of Vision Transformer (ViT) self-attention. We introduce ORBIT-2, a scalable foundation model for global, hyper-resolution climate downscaling. ORBIT-2 incorporates two key innovations: (1) Residual Slim ViT (Reslim), a lightweight architecture with residual learning and Bayesian regularization for efficient, robust prediction; and (2) TILES, a tile-wise sequence scaling algorithm that reduces self-attention complexity from quadratic to linear, enabling long-sequence processing and massive parallelism. ORBIT-2 scales to 10 billion parameters across 65,536 GPUs, achieving up to 4.1 exaFLOPS sustained throughput and 74--98% strong scaling efficiency. It supports downscaling to 0.9 km global resolution and processes sequences up to 4.2 billion tokens. On 7 km resolution benchmarks, ORBIT-2 achieves high accuracy with $R^2$ scores in the range of 0.98--0.99 against observational data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes