HCAILGApr 1, 2025

Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations

arXiv:2504.01153v44 citationsh-index: 3Comput Hum Behav
AI Analysis

This addresses the problem of improving hallucination detection for users of LLMs with integrated web search, though it is incremental as it builds on existing verification methods.

The study investigated how providing static or dynamic web search results affects people's ability to detect hallucinations in LLM-generated content, finding that both conditions reduced perceived accuracy of hallucinations and negative LLM perceptions, but dynamic searches increased accuracy ratings for genuine content and self-confidence.

While we increasingly rely on large language models (LLMs) for various tasks, these models are known to produce inaccurate content or 'hallucinations' with potentially disastrous consequences. The recent integration of web search results into LLMs prompts the question of whether people utilize them to verify the generated content, thereby accurately detecting hallucinations. An online experiment (N=560) investigated how the provision of search results, either static (i.e., fixed search results provided by LLM) or dynamic (i.e., participant-led searches), affects participants' perceived accuracy of LLM-generated content (i.e., genuine, minor hallucination, major hallucination), self-confidence in accuracy ratings, as well as their overall evaluation of the LLM, as compared to the control condition (i.e., no search results). Results showed that participants in both static and dynamic conditions (vs. control) rated hallucinated content to be less accurate and perceived the LLM more negatively. However, those in the dynamic condition rated genuine content as more accurate and demonstrated greater overall self-confidence in their assessments than those in the static search or control conditions. We highlighted practical implications of incorporating web search functionality into LLMs in real-world contexts.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes