CLAIDec 13, 2022

A fine-grained comparison of pragmatic language understanding in humans and language models

arXiv:2212.06801v2252 citationsh-index: 62
Originality Incremental advance
AI Analysis

This work addresses the challenge of pragmatic language understanding in AI, providing insights for improving human-like communication in language models.

The study compared human and language model performance on seven pragmatic language tasks, finding that the largest models achieved high accuracy and matched human error patterns, but struggled with social expectation violations.

Pragmatics and non-literal language understanding are essential to human communication, and present a long-standing challenge for artificial language models. We perform a fine-grained comparison of language models and humans on seven pragmatic phenomena, using zero-shot prompting on an expert-curated set of English materials. We ask whether models (1) select pragmatic interpretations of speaker utterances, (2) make similar error patterns as humans, and (3) use similar linguistic cues as humans to solve the tasks. We find that the largest models achieve high accuracy and match human error patterns: within incorrect responses, models favor literal interpretations over heuristic-based distractors. We also find preliminary evidence that models and humans are sensitive to similar linguistic cues. Our results suggest that pragmatic behaviors can emerge in models without explicitly constructed representations of mental states. However, models tend to struggle with phenomena relying on social expectation violations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes