CLAILGOct 15, 2024

The Fair Language Model Paradox

arXiv:2410.11985v14 citationsh-index: 14
Originality Highly original
AI Analysis

This addresses fairness concerns in language model training for users of LLMs, as low-frequency tokens represent the majority of token distributions in most languages.

The paper reveals that weight decay regularization in large language models introduces performance biases that disproportionately depreciate low-frequency tokens, with empirical evidence showing this effect across models from 270M to 3B parameters.

Large Language Models (LLMs) are widely deployed in real-world applications, yet little is known about their training dynamics at the token level. Evaluation typically relies on aggregated training loss, measured at the batch level, which overlooks subtle per-token biases arising from (i) varying token-level dynamics and (ii) structural biases introduced by hyperparameters. While weight decay is commonly used to stabilize training, we reveal that it silently introduces performance biases detectable only at the token level. In fact, we empirically show across different dataset sizes, model architectures and sizes ranging from 270M to 3B parameters that as weight decay increases, low-frequency tokens are disproportionately depreciated. This is particularly concerning, as these neglected low-frequency tokens represent the vast majority of the token distribution in most languages, calling for novel regularization techniques that ensure fairness across all available tokens.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes