CLJun 15, 2022

Emergent Abilities of Large Language Models

arXiv:2206.07682v23581 citationsh-index: 102
AI Analysis

This addresses the problem of unpredictable model behavior for AI researchers and developers, highlighting a non-incremental phenomenon in scaling.

The paper identifies emergent abilities in large language models, where certain capabilities appear only in larger models and cannot be predicted from smaller ones, suggesting that scaling could expand model capabilities.

Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models. We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models. The existence of such emergence implies that additional scaling could further expand the range of capabilities of language models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes