CLMay 19, 2025

Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks

arXiv:2505.13171v22 citationsh-index: 16
Originality Incremental advance
AI Analysis

This addresses memorization risks for copyright concerns in AI by revealing a previously overlooked positional fragility, though it is incremental in refining evaluation methods.

The study tackled the problem of verbatim memorization in large language models by identifying the offset effect, where memorization decreases with longer prefixes and sharp declines when prefixes are offset from the start of the context window, using models trained on 83B tokens to show that shifting data deeper suppresses memorization and degeneration.

Large language models are known to memorize parts of their training data, posing risk of copyright violations. To systematically examine this risk, we pretrain language models (1B/3B/8B) from scratch on 83B tokens, mixing web-scale data with public domain books used to simulate copyrighted content at controlled frequencies at lengths at least ten times longer than prior work. We thereby identified the offset effect, a phenomenon characterized by two key findings: (1) verbatim memorization is most strongly triggered by short prefixes drawn from the beginning of the context window, with memorization decreasing counterintuitively as prefix length increases; and (2) a sharp decline in verbatim recall when prefix begins offset from the initial tokens of the context window. We attribute this to positional fragility: models rely disproportionately on the earliest tokens in their context window as retrieval anchors, making them sensitive to even slight shifts. We further observe that when the model fails to retrieve memorized content, it often produces degenerated text. Leveraging these findings, we show that shifting sensitive data deeper into the context window suppresses both extractable memorization and degeneration. Our results suggest that positional offset is a critical and previously overlooked axis for evaluating memorization risks, since prior work implicitly assumed uniformity by probing only from the beginning of training sequences.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes