CLFeb 17, 2023

False perspectives on human language: why statistics needs linguistics

arXiv:2302.08822v19 citationsh-index: 31
Originality Incremental advance
AI Analysis

This resolves a foundational tension in linguistics and cognitive science, offering a unified perspective for researchers in these fields.

The paper tackles the debate between statistical and structural approaches to understanding human language, showing that only surprisal models incorporating syntactic structure can account for language regularities.

A sharp tension exists about the nature of human language between two opposite parties: those who believe that statistical surface distributions, in particular using measures like surprisal, provide a better understanding of language processing, vs. those who believe that discrete hierarchical structures implementing linguistic information such as syntactic ones are a better tool. In this paper, we show that this dichotomy is a false one. Relying on the fact that statistical measures can be defined on the basis of either structural or non-structural models, we provide empirical evidence that only models of surprisal that reflect syntactic structure are able to account for language regularities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes