AO-PHLGAug 15, 2025

CERA: A Framework for Improved Generalization of Machine Learning Models to Changed Climates

arXiv:2509.00010v13 citationsh-index: 1
Originality Highly original
AI Analysis

This addresses the challenge of robust generalization for climate science applications, offering a data-driven alternative to manual feature engineering, though it is incremental as it builds on existing climate-invariant input methods.

The authors tackled the problem of machine learning models failing to generalize under climate change by introducing CERA, a framework that uses latent-space alignment to improve generalization to warmer climates without labeled data, outperforming baselines in predicting moisture and energy tendencies and capturing precipitation extremes.

Robust generalization under climate change remains a major challenge for machine learning applications in climate science. Most existing approaches struggle to extrapolate beyond the climate they were trained on, leading to a strong dependence on training data from model simulations of warm climates. Use of climate-invariant inputs improves generalization but requires challenging manual feature engineering. Here, we present CERA (Climate-invariant Encoding through Representation Alignment), a machine learning framework consisting of an autoencoder with explicit latent-space alignment, followed by a predictor for downstream process estimation. We test CERA on the problem of parameterizing moist-physics processes. Without training on labeled data from a +4K climate, CERA leverages labeled control-climate data and unlabeled warmer-climate inputs to improve generalization to the warmer climate, outperforming both raw-input and physically informed baselines in predicting key moisture and energy tendencies. It captures not only the vertical and meridional structures of the moisture tendencies, but also shifts in the intensity distribution of precipitation including extremes. Ablation experiments show that latent alignment improves both accuracy and the robustness across random seeds used in training. While some reduced skill remains in the boundary layer, the framework offers a data-driven alternative to manual feature engineering of climate invariant inputs. Beyond parameterizations used in hybrid ML-physics systems, the approach holds promise for other climate applications such as statistical downscaling.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes