CL AIOct 16, 2024

Kallini et al. (2024) do not compare impossible languages with constituency-based ones

arXiv:2410.12271v16 citationsh-index: 1

Originality Synthesis-oriented

AI Analysis

This addresses a methodological flaw in linguistic theory research for AI and cognitive science, but it is incremental as it focuses on correcting a specific experimental comparison.

The paper critiques Kallini et al. (2024) for a confound in their experiments comparing LLMs' learning of possible vs. impossible human languages, arguing that their conclusion about LLMs' inductive biases aligning with human language constraints is unwarranted.

A central goal of linguistic theory is to find a precise characterization of the notion "possible human language", in the form of a computational device that is capable of describing all and only the languages that can be acquired by a typically developing human child. The success of recent large language models (LLMs) in NLP applications arguably raises the possibility that LLMs might be computational devices that meet this goal. This would only be the case if, in addition to succeeding in learning human languages, LLMs struggle to learn "impossible" human languages. Kallini et al. (2024; "Mission: Impossible Language Models", Proc. ACL) conducted experiments aiming to test this by training GPT-2 on a variety of synthetic languages, and found that it learns some more successfully than others. They present these asymmetries as support for the idea that LLMs' inductive biases align with what is regarded as "possible" for human languages, but the most significant comparison has a confound that makes this conclusion unwarranted. In this paper I explain the confound and suggest some ways forward towards constructing a comparison that appropriately tests the underlying issue.

View on arXiv PDF

Similar