CLLGApr 14

Better and Worse with Scale: How Contextual Entrainment Diverges with Model Size

DeepMindU of Toronto
arXiv:2604.1327583.6h-index: 40
Predicted impact top 53% in CL · last 90 daysOriginality Incremental advance
AI Analysis

Reveals that scaling alone does not uniformly improve context sensitivity but instead reshapes it, with opposing trends for semantic and non-semantic contexts, which is important for understanding and improving LLM reliability.

Larger language models become better at ignoring false claims but worse at ignoring irrelevant tokens, with entrainment following opposite scaling trends for semantic vs. non-semantic contexts. The largest models are 4x more resistant to counterfactual misinformation but 2x more prone to copying arbitrary tokens.

Larger language models become simultaneously better and worse at handling contextual information -- better at ignoring false claims, worse at ignoring irrelevant tokens. We formalize this apparent paradox through the first scaling laws for contextual entrainment, the tendency of models to favor tokens that appeared in context regardless of relevance. Analyzing the Cerebras-GPT (111M-13B) and Pythia (410M-12B) model families, we find entrainment follows predictable power-law scaling, but with opposite trends depending on context type: semantic contexts show decreasing entrainment with scale, while non-semantic contexts show increasing entrainment. Concretely, the largest models are four times more resistant to counterfactual misinformation than the smallest, yet simultaneously twice as prone to copying arbitrary tokens. These diverging trends, which replicate across model families, suggest that semantic filtering and mechanical copying are functionally distinct behaviors that scale in opposition -- scaling alone does not resolve context sensitivity, it reshapes it.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes