AO-PHLGSep 22, 2025

FastNet: Improving the physical consistency of machine-learning weather prediction models through loss function design

arXiv:2509.17601v15 citationsh-index: 83
Originality Incremental advance
AI Analysis

This work addresses the challenge of ensuring physical realism in ML weather forecasts for operational meteorology, representing an incremental advancement through loss function modifications.

The study tackled the problem of improving physical consistency in machine learning weather prediction models by designing alternative loss functions, resulting in similar MSE performance while significantly enhancing spectral fidelity and physical realism, with specific improvements in wind speed accuracy and reduced directional bias.

Machine learning weather prediction (MLWP) models have demonstrated remarkable potential in delivering accurate forecasts at significantly reduced computational cost compared to traditional numerical weather prediction (NWP) systems. However, challenges remain in ensuring the physical consistency of MLWP outputs, particularly in deterministic settings. This study presents FastNet, a graph neural network (GNN)-based global prediction model, and investigates the impact of alternative loss function designs on improving the physical realism of its forecasts. We explore three key modifications to the standard mean squared error (MSE) loss: (1) a modified spherical harmonic (MSH) loss that penalises spectral amplitude errors to reduce blurring and enhance small-scale structure retention; (2) inclusion of horizontal gradient terms in the loss to suppress non-physical artefacts; and (3) an alternative wind representation that decouples speed and direction to better capture extreme wind events. Results show that while the MSH and gradient-based losses \textit{alone} may slightly degrade RMSE scores, when trained in combination the model exhibits very similar MSE performance to an MSE-trained model while at the same time significantly improving spectral fidelity and physical consistency. The alternative wind representation further improves wind speed accuracy and reduces directional bias. Collectively, these findings highlight the importance of loss function design as a mechanism for embedding domain knowledge into MLWP models and advancing their operational readiness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes