LGCLDec 11, 2023

Sparse Transformer with Local and Seasonal Adaptation for Multivariate Time Series Forecasting

arXiv:2312.06874v215 citationsh-index: 19Has CodeSci Rep
Originality Incremental advance
AI Analysis

This work addresses efficiency and accuracy challenges in multivariate time series forecasting for applications like finance or weather prediction, but it is incremental as it builds on existing Transformer architectures.

The paper tackled the limitations of canonical attention in Transformers for multivariate time series forecasting by proposing a Dozer Attention mechanism with sparse components, achieving superior performance on nine benchmark datasets while improving efficiency without compromising accuracy.

Transformers have achieved remarkable performance in multivariate time series(MTS) forecasting due to their capability to capture long-term dependencies. However, the canonical attention mechanism has two key limitations: (1) its quadratic time complexity limits the sequence length, and (2) it generates future values from the entire historical sequence. To address this, we propose a Dozer Attention mechanism consisting of three sparse components: (1) Local, each query exclusively attends to keys within a localized window of neighboring time steps. (2) Stride, enables each query to attend to keys at predefined intervals. (3) Vary, allows queries to selectively attend to keys from a subset of the historical sequence. Notably, the size of this subset dynamically expands as forecasting horizons extend. Those three components are designed to capture essential attributes of MTS data, including locality, seasonality, and global temporal dependencies. Additionally, we present the Dozerformer Framework, incorporating the Dozer Attention mechanism for the MTS forecasting task. We evaluated the proposed Dozerformer framework with recent state-of-the-art methods on nine benchmark datasets and confirmed its superior performance. The experimental results indicate that excluding a subset of historical time steps from the time series forecasting process does not compromise accuracy while significantly improving efficiency. Code is available at https://github.com/GRYGY1215/Dozerformer.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes