HCAICLApr 4, 2024

Fakes of Varying Shades: How Warning Affects Human Perception and Engagement Regarding LLM Hallucinations

arXiv:2404.03745v327 citationsh-index: 7Has Code
Originality Synthesis-oriented
AI Analysis

This research addresses the problem of human detection of LLM hallucinations for users interacting with AI-generated content, though it is incremental in nature.

The study investigated how varying degrees of LLM hallucinations (genuine, minor, major) and warnings affect human perception and engagement, finding that participants ranked content truthfulness in that order and warnings improved hallucination detection without harming genuine content perception.

The widespread adoption and transformative effects of large language models (LLMs) have sparked concerns regarding their capacity to produce inaccurate and fictitious content, referred to as `hallucinations'. Given the potential risks associated with hallucinations, humans should be able to identify them. This research aims to understand the human perception of LLM hallucinations by systematically varying the degree of hallucination (genuine, minor hallucination, major hallucination) and examining its interaction with warning (i.e., a warning of potential inaccuracies: absent vs. present). Participants (N=419) from Prolific rated the perceived accuracy and engaged with content (e.g., like, dislike, share) in a Q/A format. Participants ranked content as truthful in the order of genuine, minor hallucination, and major hallucination, and user engagement behaviors mirrored this pattern. More importantly, we observed that warning improved the detection of hallucination without significantly affecting the perceived truthfulness of genuine content. We conclude by offering insights for future tools to aid human detection of hallucinations. All survey materials, demographic questions, and post-session questions are available at: https://github.com/MahjabinNahar/fakes-of-varying-shades-survey-materials

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes