CLAILGNov 13, 2023

An Analysis and Mitigation of the Reversal Curse

arXiv:2311.07468v338 citationsh-index: 19
Originality Synthesis-oriented
AI Analysis

This identifies a specific limitation in LLMs that affects their ability to process inverse relations, which is incremental as it builds on prior observations of the phenomenon.

The paper investigates the 'reversal curse' in large language models, where models handle sequences like 'aRb' well but struggle with 'bR^{-1}a', and finds it stems from training objectives like next-token prediction.

Recent research observed a noteworthy phenomenon in large language models (LLMs), referred to as the ``reversal curse.'' The reversal curse is that when dealing with two entities, denoted as $a$ and $b$, connected by their relation $R$ and its inverse $R^{-1}$, LLMs excel in handling sequences in the form of ``$aRb$,'' but encounter challenges when processing ``$bR^{-1}a$,'' whether in generation or comprehension. For instance, GPT-4 can accurately respond to the query ``Tom Cruise's mother is?'' with ``Mary Lee Pfeiffer,'' but it struggles to provide a satisfactory answer when asked ``Mary Lee Pfeiffer's son is?'' In this paper, we undertake the first-ever study of how the reversal curse happens in LLMs. Our investigations reveal that the reversal curse can stem from the specific training objectives, which become particularly evident in the widespread use of next-token prediction within most causal language models. We hope this initial investigation can draw more attention to the reversal curse, as well as other underlying limitations in current LLMs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes