LGApr 7, 2024

TimeGPT in Load Forecasting: A Large Time Series Model Perspective

Wenlong Liao, Fernando Porte-Agel, Jiannong Fang, Christian Rehtanz, Shouxiang Wang, Dechang Yang, Zhe Yang

arXiv:2404.04885v212.566 citationsh-index: 67Applied Energy

Originality Incremental advance

AI Analysis

This addresses load forecasting challenges for energy systems where data is limited, but it is incremental as it adapts existing large model concepts to time series.

The paper tackles load forecasting with scarce historical data by fine-tuning a large pre-trained time series model (TimeGPT) on massive diverse datasets, showing it outperforms benchmarks on several real datasets, especially for short look-ahead times, though performance can vary based on data distribution differences.

Machine learning models have made significant progress in load forecasting, but their forecast accuracy is limited in cases where historical load data is scarce. Inspired by the outstanding performance of large language models (LLMs) in computer vision and natural language processing, this paper aims to discuss the potential of large time series models in load forecasting with scarce historical data. Specifically, the large time series model is constructed as a time series generative pre-trained transformer (TimeGPT), which is trained on massive and diverse time series datasets consisting of 100 billion data points (e.g., finance, transportation, banking, web traffic, weather, energy, healthcare, etc.). Then, the scarce historical load data is used to fine-tune the TimeGPT, which helps it to adapt to the data distribution and characteristics associated with load forecasting. Simulation results show that TimeGPT outperforms the benchmarks (e.g., popular machine learning models and statistical models) for load forecasting on several real datasets with scarce training samples, particularly for short look-ahead times. However, it cannot be guaranteed that TimeGPT is always superior to benchmarks for load forecasting with scarce data, since the performance of TimeGPT may be affected by the distribution differences between the load data and the training data. In practical applications, we can divide the historical data into a training set and a validation set, and then use the validation set loss to decide whether TimeGPT is the best choice for a specific dataset.

View on arXiv PDF

Similar