CLAILGOct 22, 2024

Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination

MILA
arXiv:2410.17477v65 citationsh-index: 21ACL
Originality Incremental advance
AI Analysis

This work addresses the reliability and computational limitations of LLMs for users and developers, though it is incremental as it builds on existing concerns about hallucinations and architectural alternatives.

The study investigated how different model architectures, particularly recurrent models versus self-attention-based LLMs, affect the propensity and types of hallucinations in large language models, finding that while hallucinations are not limited to specific architectures, their occurrence and inducibility vary significantly based on architectural inductive biases.

The growth in prominence of large language models (LLMs) in everyday life can be largely attributed to their generative abilities, yet some of this is also owed to the risks and costs associated with their use. On one front is their tendency to hallucinate false or misleading information, limiting their reliability. On another is the increasing focus on the computational limitations associated with traditional self-attention based LLMs, which has brought about new alternatives, in particular recurrent models, meant to overcome them. Yet it remains uncommon to consider these two concerns simultaneously. Do changes in architecture exacerbate/alleviate existing concerns about hallucinations? Do they affect how and where they occur? Through an extensive evaluation, we study how these architecture-based inductive biases affect the propensity to hallucinate. While hallucination remains a general phenomenon not limited to specific architectures, the situations in which they occur and the ease with which specific types of hallucinations can be induced can significantly differ based on the model architecture. These findings highlight the need for better understanding both these problems in conjunction with each other, as well as consider how to design more universal techniques for handling hallucinations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes