LGJan 6, 2025

Sequence Complementor: Complementing Transformers For Time Series Forecasting with Learnable Sequences

arXiv:2501.02735v14 citationsh-index: 10AAAI
Originality Incremental advance
AI Analysis

This work addresses a key bottleneck in time series forecasting for applications requiring accurate predictions, offering an incremental improvement over existing Transformer adaptations.

The paper tackles the problem of improving Transformer performance in time series forecasting by addressing limited sequence representation diversity, proposing Sequence Complementors with a diversification loss to provide complementary information, and demonstrates superiority over state-of-the-art methods in empirical evaluations.

Since its introduction, the transformer has shifted the development trajectory away from traditional models (e.g., RNN, MLP) in time series forecasting, which is attributed to its ability to capture global dependencies within temporal tokens. Follow-up studies have largely involved altering the tokenization and self-attention modules to better adapt Transformers for addressing special challenges like non-stationarity, channel-wise dependency, and variable correlation in time series. However, we found that the expressive capability of sequence representation is a key factor influencing Transformer performance in time forecasting after investigating several representative methods, where there is an almost linear relationship between sequence representation entropy and mean square error, with more diverse representations performing better. In this paper, we propose a novel attention mechanism with Sequence Complementors and prove feasible from an information theory perspective, where these learnable sequences are able to provide complementary information beyond current input to feed attention. We further enhance the Sequence Complementors via a diversification loss that is theoretically covered. The empirical evaluation of both long-term and short-term forecasting has confirmed its superiority over the recent state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes