LGMLMar 9

Bayesian Transformer for Probabilistic Load Forecasting in Smart Grids

arXiv:2603.07899v11 citations
Predicted impact top 99% in LG · last 90 daysOriginality Incremental advance
AI Analysis

This work provides more reliable probabilistic load forecasts with better uncertainty estimates for grid operators, which is crucial for risk-based decision-making in smart grids, especially under extreme weather conditions. This is an incremental improvement on existing methods.

This study introduces a Bayesian Transformer (BT) framework to improve probabilistic load forecasting in smart grids, addressing the issue of overconfident predictions from existing deep learning models. The BT achieves a CRPS of 0.0289 on the PJM 24-hour benchmark, a 7.4% improvement over Deep Ensembles, and maintains high prediction interval coverage (89.6-90.1% PICP) during extreme weather events.

The reliable operation of modern power grids requires probabilistic load forecasts with well-calibrated uncertainty estimates. However, existing deep learning models produce overconfident point predictions that fail catastrophically under extreme weather distributional shifts. This study proposes a Bayesian Transformer (BT) framework that integrates three complementary uncertainty mechanisms into a PatchTST backbone: Monte Carlo Dropout for epistemic parameter uncertainty, variational feed-forward layers with log-uniform weight priors, and stochastic attention with learnable Gaussian noise perturbations on pre-softmax logits, representing, to the best of our knowledge, the first application of Bayesian attention to probabilistic load forecasting. A seven-level multi-quantile pinball-loss prediction head and post-training isotonic regression calibration produce sharp, near-nominally covered prediction intervals. Evaluation of five grid datasets (PJM, ERCOT, ENTSO-E Germany, France, and Great Britain) augmented with NOAA covariates across 24, 48, and 168-hour horizons demonstrates state-of-the-art performance. On the primary benchmark (PJM, H=24h), BT achieves a CRPS of 0.0289, improving 7.4% over Deep Ensembles and 29.9% over the deterministic LSTM, with 90.4% PICP at the 90% nominal level and the narrowest prediction intervals (4,960 MW) among all probabilistic baselines. During heat-wave and cold snap events, BT maintained 89.6% and 90.1% PICP respectively, versus 64.7% and 67.2% for the deterministic LSTM, confirming that Bayesian epistemic uncertainty naturally widens intervals for out-of-distribution inputs. Calibration remained stable across all horizons (89.8-90.4% PICP), while ablation confirmed that each component contributed a distinct value. The calibrated outputs directly support risk-based reserve sizing, stochastic unit commitment, and demand response activation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes