CLJul 23, 2024

Shared Imagination: LLMs Hallucinate Alike

arXiv:2407.16604v15 citationsh-index: 27
Originality Incremental advance
AI Analysis

This addresses the issue of model homogeneity and hallucination in LLMs for AI researchers, though it is incremental as it builds on existing understanding of model behavior.

The paper tackles the problem of understanding similarity among large language models (LLMs) by introducing imaginary question answering (IQA), where models generate and answer fictional questions; the result shows that models can answer each other's imaginary questions with remarkable success, indicating a shared imagination space.

Despite the recent proliferation of large language models (LLMs), their training recipes -- model architecture, pre-training data and optimization algorithm -- are often very similar. This naturally raises the question of the similarity among the resulting models. In this paper, we propose a novel setting, imaginary question answering (IQA), to better understand model similarity. In IQA, we ask one model to generate purely imaginary questions (e.g., on completely made-up concepts in physics) and prompt another model to answer. Surprisingly, despite the total fictionality of these questions, all models can answer each other's questions with remarkable success, suggesting a "shared imagination space" in which these models operate during such hallucinations. We conduct a series of investigations into this phenomenon and discuss implications on model homogeneity, hallucination, and computational creativity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes