LGPRSTFeb 25

Support Tokens, Stability Margins, and a New Foundation for Robust LLMs

arXiv:2602.22271v11 citationsh-index: 6
Originality Incremental advance
AI Analysis

This addresses robustness issues in large language models by providing a theoretical foundation and practical training modification, though it is incremental as it builds on existing transformer architectures.

The paper reinterprets causal self-attention transformers in a probabilistic framework, revealing a barrier constraint that leads to support tokens and stability margins, and proposes a Bayesian approach with a log-barrier penalty for training more robust LLMs without accuracy loss.

Self-attention is usually described as a flexible, content-adaptive way to mix a token with information from its past. We re-interpret causal self-attention transformers, the backbone of modern foundation models, within a probabilistic framework, much like how classical PCA is extended to probabilistic PCA. However, this re-formulation reveals a surprising and deeper structural insight: due to a change-of-variables phenomenon, a barrier constraint emerges on the self-attention parameters. This induces a highly structured geometry on the token space, providing theoretical insights into the dynamics of LLM decoding. This reveals a boundary where attention becomes ill-conditioned, leading to a margin interpretation similar to classical support vector machines. Just like support vectors, this naturally gives rise to the concept of support tokens. Furthermore, we show that LLMs can be interpreted as a stochastic process over the power set of the token space, providing a rigorous probabilistic framework for sequence modeling. We propose a Bayesian framework and derive a MAP estimation objective that requires only a minimal modification to standard LLM training: the addition of a smooth log-barrier penalty to the usual cross-entropy loss. We demonstrate that this provides more robust models without sacrificing out-of-sample accuracy and that it is straightforward to incorporate in practice.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes