Bridging the climate to energy data gap: simulated annealing for representative climate year selection

arXiv:2605.159582.3
Predicted impact top 98% in AO-PH · last 90 daysOriginality Incremental advance
AI Analysis

For energy system modelers, it provides a validated method to select representative climate years, reducing bias in investment decisions and capturing plausible weather conditions.

The paper addresses the mismatch between the hundreds of climate years used in climate science and the few years that energy system models can process. It proposes simulated annealing for selecting representative climate years, achieving subsets 2.5–3.5 times more representative than current ENTSO-E practice and an effective sample size 4–5 times the subset size.

Energy system models are increasingly dependent on representative climate input. Yet, a fundamental mismatch persists between the hundreds of simulated years often used in climate science and the handful of years that computationally demanding power system models can process. Current practice, including ENTSO-E's European Resource Adequacy Assessment, relies on climate year selections that have not been validated against explicit representativeness criteria. This risks biased investment decisions and blind spots for plausible weather conditions. This study proposes simulated annealing as an optimisation method for selecting representative subsets of complete climate years from large climate ensembles. Representativeness is quantified using the seasonal sliced Wasserstein distance, a metric from optimal transport theory that captures representativeness on marginal distributions, inter-variable correlations, and seasonal structure simultaneously. We evaluate simulated annealing against the alternative methods random search, filtered random search, and K-Medoids clustering across three test cases spanning the Netherlands and Europe, using 180 climate years from the Pan-European Climate Database as a reference. Simulated annealing consistently produces the most representative subsets and outperforms all compared methods. Simulated annealing achieves an effective sample size four to five times the actual subset size. The resulting subsets are roughly 2.5--3.5 times more representative than current ENTSO-E practice. The method is application-agnostic and its output can serve as a validated climate data input to any subsequent (energy) impact study.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes