LGJun 24, 2025

Scaling Transformers for Time Series Forecasting: Do Pretrained Large Models Outperform Small-Scale Alternatives?

arXiv:2507.02907v12 citationsh-index: 6Artif Intell Rev
Originality Synthesis-oriented
AI Analysis

It addresses the effectiveness of large pre-trained models for time series forecasting, which is an incremental study comparing existing methods.

This paper investigates whether pre-trained large-scale time series models outperform small-scale transformers in forecasting tasks, finding that pre-trained models have strengths but simpler models remain competitive in certain scenarios.

Large pre-trained models have demonstrated remarkable capabilities across domains, but their effectiveness in time series forecasting remains understudied. This work empirically examines whether pre-trained large-scale time series models (LSTSMs) trained on diverse datasets can outperform traditional non-pretrained small-scale transformers in forecasting tasks. We analyze state-of-the-art (SOTA) pre-trained universal time series models (e.g., Moirai, TimeGPT) alongside conventional transformers, evaluating accuracy, computational efficiency, and interpretability across multiple benchmarks. Our findings reveal the strengths and limitations of pre-trained LSTSMs, providing insights into their suitability for time series tasks compared to task-specific small-scale architectures. The results highlight scenarios where pretraining offers advantages and where simpler models remain competitive.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes