CLAILGMay 24, 2025

Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics

arXiv:2505.18658v213 citationsh-index: 1Trans. Mach. Learn. Res.
Originality Synthesis-oriented
AI Analysis

It addresses the critical challenge of robustness in LLMs for researchers and practitioners in NLP and AI, but it is incremental as it synthesizes existing knowledge without introducing new methods or results.

This survey tackles the problem of ensuring robustness in Large Language Models by providing a comprehensive overview of current studies, including examining conceptual foundations, analyzing sources of non-robustness, reviewing mitigation strategies, and discussing evaluation metrics and gaps.

Large Language Models (LLMs) have emerged as a promising cornerstone for the development of natural language processing (NLP) and artificial intelligence (AI). However, ensuring the robustness of LLMs remains a critical challenge. To address these challenges and advance the field, this survey provides a comprehensive overview of current studies in this area. First, we systematically examine the nature of robustness in LLMs, including its conceptual foundations, the importance of consistent performance across diverse inputs, and the implications of failure modes in real-world applications. Next, we analyze the sources of non-robustness, categorizing intrinsic model limitations, data-driven vulnerabilities, and external adversarial factors that compromise reliability. Following this, we review state-of-the-art mitigation strategies, and then we discuss widely adopted benchmarks, emerging metrics, and persistent gaps in assessing real-world reliability. Finally, we synthesize findings from existing surveys and interdisciplinary studies to highlight trends, unresolved issues, and pathways for future research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes