Is it Cake or is it AI? A Systematic Review of Human Uncertainty in Distinguishing Generative Artificial Intelligence Content
For researchers and practitioners concerned with content authenticity and trust, this review synthesizes evidence that humans cannot reliably detect AI-generated content, questioning the importance of this ability.
This systematic review of 30 studies found that human ability to distinguish generative AI content from human-produced content across text, image, and voice modalities generally clusters around chance performance, indicating humans are unreliable detectors.
This systematic review synthesized empirical evidence on human ability to distinguish generative artificial intelligence content from human produced content across text, image, and voice modalities. A structured search of Scopus identified 22,541 records from 2025 to 2026, of which 1200 were screened and 30 studies were included. Across these studies, human detection accuracy varied widely but generally clustered around chance performance. Overall, the literature shows that humans are generally unreliable detectors of gen AI content, raising broader questions about whether the ability to tell should matter for how we evaluate or trust content.