LGFeb 18

SEMixer: Semantics Enhanced MLP-Mixer for Multiscale Mixing and Long-term Time Series Forecasting

Xu Zhang, Qitong Wang, Peng Wang, Wei Wang

Harvard

arXiv:2602.16220v11.41 citationsh-index: 6Has Code

Originality Incremental advance

AI Analysis

This work addresses efficient multiscale modeling for long-term time series forecasting, particularly in domains like wireless networks, but appears incremental as it builds on MLP-Mixer with enhancements.

The authors tackled the challenge of modeling multiscale patterns in long-term time series forecasting by proposing SEMixer, which achieved third place in the 2025 CCF AlOps Challenge on real wireless network data.

Modeling multiscale patterns is crucial for long-term time series forecasting (TSF). However, redundancy and noise in time series, together with semantic gaps between non-adjacent scales, make the efficient alignment and integration of multi-scale temporal dependencies challenging. To address this, we propose SEMixer, a lightweight multiscale model designed for long-term TSF. SEMixer features two key components: a Random Attention Mechanism (RAM) and a Multiscale Progressive Mixing Chain (MPMC). RAM captures diverse time-patch interactions during training and aggregates them via dropout ensemble at inference, enhancing patch-level semantics and enabling MLP-Mixer to better model multi-scale dependencies. MPMC further stacks RAM and MLP-Mixer in a memory-efficient manner, achieving more effective temporal mixing. It addresses semantic gaps across scales and facilitates better multiscale modeling and forecasting performance. We not only validate the effectiveness of SEMixer on 10 public datasets, but also on the \textit{2025 CCF AlOps Challenge} based on 21GB real wireless network data, where SEMixer achieves third place. The code is available at the link https://github.com/Meteor-Stars/SEMixer.

View on arXiv PDF Code

Similar