LGAIJan 30

Do Transformers Have the Ability for Periodicity Generalization?

Peking U
arXiv:2601.22690v1h-index: 19
Originality Incremental advance
AI Analysis

This addresses a limitation in large language models for AI researchers, but it is incremental as it focuses on a specific generalization gap.

The paper investigates whether Transformers can generalize periodic patterns to out-of-distribution scenarios, finding that while they memorize periodic data during training, they fail to generalize to unseen composite periodicity.

Large language models (LLMs) based on the Transformer have demonstrated strong performance across diverse tasks. However, current models still exhibit substantial limitations in out-of-distribution (OOD) generalization compared with humans. We investigate this gap through periodicity, one of the basic OOD scenarios. Periodicity captures invariance amid variation. Periodicity generalization represents a model's ability to extract periodic patterns from training data and generalize to OOD scenarios. We introduce a unified interpretation of periodicity from the perspective of abstract algebra and reasoning, including both single and composite periodicity, to explain why Transformers struggle to generalize periodicity. Then we construct Coper about composite periodicity, a controllable generative benchmark with two OOD settings, Hollow and Extrapolation. Experiments reveal that periodicity generalization in Transformers is limited, where models can memorize periodic data during training, but cannot generalize to unseen composite periodicity. We release the source code to support future research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes