AO-PHLGApr 4, 2024

Site-specific Deterministic Temperature and Humidity Forecasts with Explainable and Reliable Machine Learning

arXiv:2404.03310v12 citationsh-index: 4Appl Sci
Originality Incremental advance
AI Analysis

This work addresses the need for accurate local weather predictions for energy demand forecasting, but it is incremental as it applies an existing machine learning method to a known bottleneck in weather modeling.

The study tackled the problem of site-specific weather forecasts for energy operators by using gradient boosting decision trees (XGBoost) to predict temperature and dew point at multiple sites in Australia, showing significant improvement over bias-corrected numerical weather prediction models and comparable results to non-machine learning methods.

Site-specific weather forecasts are essential to accurate prediction of power demand and are consequently of great interest to energy operators. However, weather forecasts from current numerical weather prediction (NWP) models lack the fine-scale detail to capture all important characteristics of localised real-world sites. Instead they provide weather information representing a rectangular gridbox (usually kilometres in size). Even after post-processing and bias correction, area-averaged information is usually not optimal for specific sites. Prior work on site optimised forecasts has focused on linear methods, weighted consensus averaging, time-series methods, and others. Recent developments in machine learning (ML) have prompted increasing interest in applying ML as a novel approach towards this problem. In this study, we investigate the feasibility of optimising forecasts at sites by adopting the popular machine learning model gradient boosting decision tree, supported by the Python version of the XGBoost package. Regression trees have been trained with historical NWP and site observations as training data, aimed at predicting temperature and dew point at multiple site locations across Australia. We developed a working ML framework, named 'Multi-SiteBoost' and initial testing results show a significant improvement compared with gridded values from bias-corrected NWP models. The improvement from XGBoost is found to be comparable with non-ML methods reported in literature. With the insights provided by SHapley Additive exPlanations (SHAP), this study also tests various approaches to understand the ML predictions and increase the reliability of the forecasts generated by ML.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes