CLSep 30, 2023

Understanding In-Context Learning from Repetitions

TencentTsinghua
arXiv:2310.00297v335 citationsh-index: 13
Originality Incremental advance
AI Analysis

This provides insights into a fundamental AI capability, but it is incremental as it builds on existing understanding of in-context learning.

The paper tackles the mechanism of in-context learning in Large Language Models by examining surface repetitions, identifying token co-occurrence reinforcement as a key principle that strengthens token relationships based on contextual co-occurrences, and explaining reasons for its failures.

This paper explores the elusive mechanism underpinning in-context learning in Large Language Models (LLMs). Our work provides a novel perspective by examining in-context learning via the lens of surface repetitions. We quantitatively investigate the role of surface features in text generation, and empirically establish the existence of \emph{token co-occurrence reinforcement}, a principle that strengthens the relationship between two tokens based on their contextual co-occurrences. By investigating the dual impacts of these features, our research illuminates the internal workings of in-context learning and expounds on the reasons for its failures. This paper provides an essential contribution to the understanding of in-context learning and its potential limitations, providing a fresh perspective on this exciting capability.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes