CLAIOct 30, 2023

Large Language Models: The Need for Nuance in Current Debates and a Pragmatic Perspective on Understanding

arXiv:2310.19671v2140 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

It addresses philosophical debates about AI understanding for researchers and society, but is incremental as it refines existing critiques without new empirical results.

The paper critiques common arguments against LLM capacities, showing they lack nuance, and proposes a pragmatic framework for attributing understanding to LLMs based on utility rather than mental states.

Current Large Language Models (LLMs) are unparalleled in their ability to generate grammatically correct, fluent text. LLMs are appearing rapidly, and debates on LLM capacities have taken off, but reflection is lagging behind. Thus, in this position paper, we first zoom in on the debate and critically assess three points recurring in critiques of LLM capacities: i) that LLMs only parrot statistical patterns in the training data; ii) that LLMs master formal but not functional language competence; and iii) that language learning in LLMs cannot inform human language learning. Drawing on empirical and theoretical arguments, we show that these points need more nuance. Second, we outline a pragmatic perspective on the issue of `real' understanding and intentionality in LLMs. Understanding and intentionality pertain to unobservable mental states we attribute to other humans because they have pragmatic value: they allow us to abstract away from complex underlying mechanics and predict behaviour effectively. We reflect on the circumstances under which it would make sense for humans to similarly attribute mental states to LLMs, thereby outlining a pragmatic philosophical context for LLMs as an increasingly prominent technology in society.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes