LGMLOct 14, 2024

GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation

arXiv:2410.10393v2117 citationsh-index: 35Has Code
Originality Synthesis-oriented
AI Analysis

This provides a standardized evaluation framework for researchers and practitioners in time series forecasting, though it is incremental as it builds on existing benchmarking efforts.

The authors tackled the lack of comprehensive benchmarks for time series foundation models by introducing GIFT-Eval, a benchmark with 23 datasets, 144,000 time series, and 177 million data points, which includes analysis of 17 baselines and a pretraining dataset of 230 billion data points.

Time series foundation models excel in zero-shot forecasting, handling diverse tasks without explicit training. However, the advancement of these models has been hindered by the lack of comprehensive benchmarks. To address this gap, we introduce the General Time Series Forecasting Model Evaluation, GIFT-Eval, a pioneering benchmark aimed at promoting evaluation across diverse datasets. GIFT-Eval encompasses 23 datasets over 144,000 time series and 177 million data points, spanning seven domains, 10 frequencies, multivariate inputs, and prediction lengths ranging from short to long-term forecasts. To facilitate the effective pretraining and evaluation of foundation models, we also provide a non-leaking pretraining dataset containing approximately 230 billion data points. Additionally, we provide a comprehensive analysis of 17 baselines, which includes statistical models, deep learning models, and foundation models. We discuss each model in the context of various benchmark characteristics and offer a qualitative analysis that spans both deep learning and foundation models. We believe the insights from this analysis, along with access to this new standard zero-shot time series forecasting benchmark, will guide future developments in time series foundation models. Code, data, and the leaderboard can be found at https://github.com/SalesforceAIResearch/gift-eval .

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes