LGAO-PHOct 24, 2024

Recommendations for Comprehensive and Independent Evaluation of Machine Learning-Based Earth System Models

arXiv:2410.19882v212 citationsh-index: 114
Originality Synthesis-oriented
AI Analysis

This work tackles the problem of ensuring reliable and widely adoptable ML-based ESMs for Earth science researchers and policymakers, but it is incremental as it focuses on evaluation guidelines rather than new modeling breakthroughs.

The paper addresses the challenge of establishing credibility for machine learning-based Earth system models (ESMs) by proposing five recommendations for comprehensive, standardized, and independent evaluation, as these models lack explicit physical principles and must represent future states without historical observations.

Machine learning (ML) is a revolutionary technology with demonstrable applications across multiple disciplines. Within the Earth science community, ML has been most visible for weather forecasting, producing forecasts that rival modern physics-based models. Given the importance of deepening our understanding and improving predictions of the Earth system on all time scales, efforts are now underway to develop forecasting models into Earth-system models (ESMs), capable of representing all components of the coupled Earth system (or their aggregated behavior) and their response to external changes. Modeling the Earth system is a much more difficult problem than weather forecasting, not least because the model must represent the alternate (e.g., future) coupled states of the system for which there are no historical observations. Given that the physical principles that enable predictions about the response of the Earth system are often not explicitly coded in these ML-based models, demonstrating the credibility of ML-based ESMs thus requires us to build evidence of their consistency with the physical system. To this end, this paper puts forward five recommendations to enhance comprehensive, standardized, and independent evaluation of ML-based ESMs to strengthen their credibility and promote their wider use.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes