CL AI CY LGSep 3, 2023

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Chen Xu, Yulong Chen, Longyue Wang

arXiv:2309.01219v338.01034 citationsHas Code

Originality Synthesis-oriented

AI Analysis

It addresses reliability issues for users of LLMs in real-world applications, but is incremental as it synthesizes existing research rather than introducing new methods.

This paper surveys the problem of hallucinations in large language models (LLMs), where models generate inaccurate or inconsistent content, and reviews recent efforts on detection, explanation, and mitigation strategies.

While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge to the reliability of LLMs in real-world scenarios. In this paper, we survey recent efforts on the detection, explanation, and mitigation of hallucination, with an emphasis on the unique challenges posed by LLMs. We present taxonomies of the LLM hallucination phenomena and evaluation benchmarks, analyze existing approaches aiming at mitigating LLM hallucination, and discuss potential directions for future research.

View on arXiv PDF Code

Similar