LGAIAug 28, 2024

Mamba or Transformer for Time Series Forecasting? Mixture of Universals (MoU) Is All You Need

arXiv:2408.15997v12 citationsh-index: 23Has Code
Originality Incremental advance
AI Analysis

This work addresses a key limitation in time series forecasting models for applications requiring both efficiency and accuracy, though it appears incremental as it builds on existing architectures like Mamba and Transformers.

The paper tackles the challenge of balancing short-term and long-term dependencies in time series forecasting by introducing Mixture of Universals (MoU), which achieves state-of-the-art performance on seven real-world datasets while maintaining low computational costs.

Time series forecasting requires balancing short-term and long-term dependencies for accurate predictions. Existing methods mainly focus on long-term dependency modeling, neglecting the complexities of short-term dynamics, which may hinder performance. Transformers are superior in modeling long-term dependencies but are criticized for their quadratic computational cost. Mamba provides a near-linear alternative but is reported less effective in time series longterm forecasting due to potential information loss. Current architectures fall short in offering both high efficiency and strong performance for long-term dependency modeling. To address these challenges, we introduce Mixture of Universals (MoU), a versatile model to capture both short-term and long-term dependencies for enhancing performance in time series forecasting. MoU is composed of two novel designs: Mixture of Feature Extractors (MoF), an adaptive method designed to improve time series patch representations for short-term dependency, and Mixture of Architectures (MoA), which hierarchically integrates Mamba, FeedForward, Convolution, and Self-Attention architectures in a specialized order to model long-term dependency from a hybrid perspective. The proposed approach achieves state-of-the-art performance while maintaining relatively low computational costs. Extensive experiments on seven real-world datasets demonstrate the superiority of MoU. Code is available at https://github.com/lunaaa95/mou/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes