AILGJan 27

Beyond In-Domain Detection: SpikeScore for Cross-Domain Hallucination Detection

arXiv:2601.19245v1h-index: 2
Originality Incremental advance
AI Analysis

This addresses a critical issue for deploying LLMs in real-world applications by improving detection across diverse domains, though it is incremental as it builds on existing detection methods.

The paper tackles the problem of poor cross-domain generalization in hallucination detection for large language models by proposing SpikeScore, a method that quantifies uncertainty fluctuations in multi-turn dialogues, and demonstrates it outperforms baselines in cross-domain settings.

Hallucination detection is critical for deploying large language models (LLMs) in real-world applications. Existing hallucination detection methods achieve strong performance when the training and test data come from the same domain, but they suffer from poor cross-domain generalization. In this paper, we study an important yet overlooked problem, termed generalizable hallucination detection (GHD), which aims to train hallucination detectors on data from a single domain while ensuring robust performance across diverse related domains. In studying GHD, we simulate multi-turn dialogues following LLMs initial response and observe an interesting phenomenon: hallucination-initiated multi-turn dialogues universally exhibit larger uncertainty fluctuations than factual ones across different domains. Based on the phenomenon, we propose a new score SpikeScore, which quantifies abrupt fluctuations in multi-turn dialogues. Through both theoretical analysis and empirical validation, we demonstrate that SpikeScore achieves strong cross-domain separability between hallucinated and non-hallucinated responses. Experiments across multiple LLMs and benchmarks demonstrate that the SpikeScore-based detection method outperforms representative baselines in cross-domain generalization and surpasses advanced generalization-oriented methods, verifying the effectiveness of our method in cross-domain hallucination detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes