LGApr 14

Analyzing the Effect of Noise in LLM Fine-tuning

arXiv:2604.1246945.4h-index: 2

AI Analysis

For practitioners fine-tuning LLMs, this work provides insights into which noise types are most harmful and how noise propagates through the network, though the findings are incremental and confirm expected trends.

This paper systematically studies how different types of noise (label, grammatical, typographical) affect LLM fine-tuning across three model families and three tasks, finding that label noise causes the largest performance degradation while other noise types can sometimes provide mild regularization benefits.

Fine-tuning is the dominant paradigm for adapting pretrained large language models (LLMs) to downstream NLP tasks. In practice, fine-tuning datasets may contain various forms of noise arising from annotation errors, preprocessing artifacts, or automated data collection. While prior work has focused on designing robust learning algorithms to mitigate performance degradation under noisy conditions, comparatively little is known about how different types of noise affect the internal learning dynamics of LLMs during fine-tuning. In this work, we systematically study the impact of noise on model behavior across three pretrained model families (GPT-2, Qwen2 and Llama-2) and three diverse NLP tasks. We introduce controlled perturbations corresponding to three common real-world noise types: label noise, grammatical noise, and typographical noise. Beyond task-level performance, we analyze layer-wise representation changes and attention patterns to understand how noise propagates through the network. Our results show that corrupting labels (i.e. label noise) consistently causes the largest performance degradation, whereas grammatical noise and typographical noise can occasionally yield mild regularization benefits. We further find that noise effects are localized primarily to task-specific layers, while attention structures remain comparatively stable.

View on arXiv PDF

Similar