CL AIMay 23, 2025

keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection

Saketh Reddy Vemula, Parameswari Krishnamurthy

arXiv:2505.17485v11 citationsh-index: 5Has Code

Originality Incremental advance

AI Analysis

This addresses the need for cost-effective and adaptable hallucination detection in multilingual applications, though it appears incremental as it builds on existing uncertainty-based approaches.

The paper tackled the problem of identifying hallucination spans in black-box language model generated text by using an entropy-based analysis of stochastically-sampled responses, achieving accurate detection without additional training.

Identification of hallucination spans in black-box language model generated text is essential for applications in the real world. A recent attempt at this direction is SemEval-2025 Task 3, Mu-SHROOM-a Multilingual Shared Task on Hallucinations and Related Observable Over-generation Errors. In this work, we present our solution to this problem, which capitalizes on the variability of stochastically-sampled responses in order to identify hallucinated spans. Our hypothesis is that if a language model is certain of a fact, its sampled responses will be uniform, while hallucinated facts will yield different and conflicting results. We measure this divergence through entropy-based analysis, allowing for accurate identification of hallucinated segments. Our method is not dependent on additional training and hence is cost-effective and adaptable. In addition, we conduct extensive hyperparameter tuning and perform error analysis, giving us crucial insights into model behavior.

View on arXiv PDF Code

Similar