LG AIMay 6

Superposition Is Not Necessary: A Mechanistic Interpretability Analysis of Transformer Representations for Time Series Forecasting

arXiv:2605.051517.5

Predicted impact top 72% in LG · last 90 daysOriginality Incremental advance

AI Analysis

For the time series forecasting community, this work provides a mechanistic explanation for the competitiveness of linear models, suggesting that standard benchmarks do not require the rich compositional representations that drive transformer success in NLP.

The authors investigate whether transformer representations in time series forecasting rely on superposition, as seen in NLP. They find no evidence of superposition in PatchTST's FFN activations; representations remain sparse and stable, with minimal impact from causal interventions, explaining why simple linear models like DLinear remain competitive.

Transformer architectures have been widely adopted for time series forecasting, yet whether the representational mechanisms that make them powerful in NLP actually engage on time series data remains unexplored. The persistent competitiveness of simple linear models such as DLinear has fueled ongoing debate, but no mechanistic explanation for this phenomenon has been offered. We address this gap by applying sparse autoencoders (SAEs), a tool from mechanistic interpretability, to probe the internal representations of PatchTST. We first establish that a single-layer, narrow-dimensional transformer matches the forecasting performance of deeper configurations across commonly used benchmarks. We then train SAEs on the post-GELU intermediate FFN activations with dictionary sizes ranging from 0.5x to 4.0x the native dimensionality. Expanding the dictionary yields negligible downstream performance change (average 0.214%), with large portions of overcomplete dictionaries remaining inactive. Targeted causal interventions on dominant latent features produce minimal forecast perturbation. Across all evaluated settings, we observe no empirical evidence that the analyzed FFN representations rely on strong superposition. Instead, the representations remain sparse, stable under aggressive dictionary expansion, and largely insensitive to latent interventions. These results demonstrate that superposition is not necessary for competitive performance on standard forecasting benchmarks, suggesting they may not demand the rich compositional representations that drive transformer success in language modeling, and helping explain the persistent competitiveness of simple linear models

View on arXiv PDF

Similar