CLAIJun 25, 2024

Banishing LLM Hallucinations Requires Rethinking Generalization

arXiv:2406.17642v223 citations
Originality Highly original
AI Analysis

This addresses the issue of hallucinations in LLMs for users relying on accurate outputs, presenting a novel method rather than an incremental improvement.

The paper tackles the problem of LLM hallucinations by demonstrating that traditional approaches fail to explain them, showing that LLMs can memorize random data and hallucinate when training loss is above a threshold, and introduces Lamini-1 as a model that stores facts in a mixture of memory experts to remove hallucinations.

Despite their powerful chat, coding, and reasoning abilities, Large Language Models (LLMs) frequently hallucinate. Conventional wisdom suggests that hallucinations are a consequence of a balance between creativity and factuality, which can be mitigated, but not eliminated, by grounding the LLM in external knowledge sources. Through extensive systematic experiments, we show that these traditional approaches fail to explain why LLMs hallucinate in practice. Specifically, we show that LLMs augmented with a massive Mixture of Memory Experts (MoME) can easily memorize large datasets of random numbers. We corroborate these experimental findings with a theoretical construction showing that simple neural networks trained to predict the next token hallucinate when the training loss is above a threshold as it usually does in practice when training on internet scale data. We interpret our findings by comparing against traditional retrieval methods for mitigating hallucinations. We use our findings to design a first generation model for removing hallucinations -- Lamini-1 -- that stores facts in a massive mixture of millions of memory experts that are retrieved dynamically.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes