AILGNANAApr 14

Numerical Instability and Chaos: Quantifying the Unpredictability of Large Language Models

arXiv:2604.1320630.4h-index: 6
AI Analysis

For developers and users of LLMs in agentic workflows, this work provides a mechanistic understanding of reliability issues stemming from numerical precision.

This paper identifies numerical instability due to finite floating-point precision as a root cause of unpredictability in LLMs, revealing a chaotic 'avalanche effect' in early layers and three distinct regimes of behavior (stable, chaotic, signal-dominated).

As Large Language Models (LLMs) are increasingly integrated into agentic workflows, their unpredictability stemming from numerical instability has emerged as a critical reliability issue. While recent studies have demonstrated the significant downstream effects of these instabilities, the root causes and underlying mechanisms remain poorly understood. In this paper, we present a rigorous analysis of how unpredictability is rooted in the finite numerical precision of floating-point representations, tracking how rounding errors propagate, amplify, or dissipate through Transformer computation layers. Specifically, we identify a chaotic "avalanche effect" in the early layers, where minor perturbations trigger binary outcomes: either rapid amplification or complete attenuation. Beyond specific error instances, we demonstrate that LLMs exhibit universal, scale-dependent chaotic behaviors characterized by three distinct regimes: 1) a stable regime, where perturbations fall below an input-dependent threshold and vanish, resulting in constant outputs; 2) a chaotic regime, where rounding errors dominate and drive output divergence; and 3) a signal-dominated regime, where true input variations override numerical noise. We validate these findings extensively across multiple datasets and model architectures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes