CLJul 27, 2024

Investigating Critical Period Effects in Language Acquisition through Neural Language Models

Cambridge
arXiv:2407.19325v221 citationsh-index: 27
Originality Incremental advance
AI Analysis

This addresses the problem of understanding the origins of critical periods in language learning for cognitive science and AI, but it is incremental as it builds on existing LM research.

The study investigated whether critical period effects in language acquisition, where learning becomes harder after early childhood, are unique to humans or shared by language models, finding that LMs do not show these effects without additional engineering, contradicting claims that they arise from statistical learning alone.

Humans appear to have a critical period (CP) for language acquisition: Second language (L2) acquisition becomes harder after early childhood, and ceasing exposure to a first language (L1) after this period (but not before) typically does not lead to substantial loss of L1 proficiency. It is unknown whether these CP effects result from innately determined brain maturation or as a stabilization of neural connections naturally induced by experience. In this study, we use language models (LMs) to test the extent to which these phenomena are peculiar to humans, or shared by a broader class of language learners. We vary the age of exposure by training LMs on language pairs in various experimental conditions, and find that LMs, which lack any direct analog to innate maturational stages, do not show CP effects when the age of exposure of L2 is delayed. Our results contradict the claim that CP effects are an inevitable result of statistical learning, and they are consistent with an innate mechanism for CP effects. We show that we can reverse-engineer the CP by introducing a regularizer partway through training to simulate a maturational decrease in plasticity. All in all, our results suggest that L1 learning on its own may not be enough to induce a CP, and additional engineering is necessary to make language models more cognitively plausible.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes