AO-PHLGMar 30, 2024

Aardvark weather: end-to-end data-driven weather forecasting

arXiv:2404.00411v32 citationsh-index: 19
Originality Highly original
AI Analysis

This addresses the problem of high computational costs and limited accuracy in weather forecasting for applications like transportation and safety, representing a significant advancement rather than an incremental improvement.

The paper tackles weather forecasting by replacing the entire operational numerical weather prediction pipeline with an end-to-end data-driven machine learning model, achieving global forecasts that outperform a baseline and local forecasts with comparable or lower errors using only 8% of input data and three orders of magnitude less compute.

Weather forecasting is critical for a range of human activities including transportation, agriculture, industry, as well as the safety of the general public. Machine learning models have the potential to transform the complex weather prediction pipeline, but current approaches still rely on numerical weather prediction (NWP) systems, limiting forecast speed and accuracy. Here we demonstrate that a machine learning model can replace the entire operational NWP pipeline. Aardvark Weather, an end-to-end data-driven weather prediction system, ingests raw observations and outputs global gridded forecasts and local station forecasts. Further, it can be optimised end-to-end to maximise performance over quantities of interest. Global forecasts outperform an operational NWP baseline for multiple variables and lead times. Local station forecasts are skillful up to ten days lead time and achieve comparable and often lower errors than a post-processed global NWP baseline and a state-of-the-art end-to-end forecasting system with input from human forecasters. These forecasts are produced with a remarkably simple neural process model using just 8% of the input data and three orders of magnitude less compute than existing NWP and hybrid AI-NWP methods. We anticipate that Aardvark Weather will be the starting point for a new generation of end-to-end machine learning models for medium-range forecasting that will reduce computational costs by orders of magnitude and enable the rapid and cheap creation of bespoke models for users in a variety of fields, including for the developing world where state-of-the-art local models are not currently available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes