CLFeb 28, 2023

Information-Restricted Neural Language Models Reveal Different Brain Regions' Sensitivity to Semantics, Syntax and Context

Alexandre Pasquiou, Yair Lakretz, Bertrand Thirion, Christophe Pallier

arXiv:2302.14389v13.924 citationsh-index: 52Has Code

Originality Incremental advance

AI Analysis

This research addresses a fundamental question in neurolinguistics about the spatial organization of language processing in the brain, providing insights into how syntactic and semantic information are integrated across hemispheres.

The study investigated how brain regions process syntax and semantics during speech comprehension by training language models with restricted information and comparing their predictions to fMRI data from humans listening to text. Results showed that most language regions are sensitive to both syntax and semantics with varying magnitudes, revealing hemispheric asymmetries in processing and context sensitivity.

A fundamental question in neurolinguistics concerns the brain regions involved in syntactic and semantic processing during speech comprehension, both at the lexical (word processing) and supra-lexical levels (sentence and discourse processing). To what extent are these regions separated or intertwined? To address this question, we trained a lexical language model, Glove, and a supra-lexical language model, GPT-2, on a text corpus from which we selectively removed either syntactic or semantic information. We then assessed to what extent these information-restricted models were able to predict the time-courses of fMRI signal of humans listening to naturalistic text. We also manipulated the size of contextual information provided to GPT-2 in order to determine the windows of integration of brain regions involved in supra-lexical processing. Our analyses show that, while most brain regions involved in language are sensitive to both syntactic and semantic variables, the relative magnitudes of these effects vary a lot across these regions. Furthermore, we found an asymmetry between the left and right hemispheres, with semantic and syntactic processing being more dissociated in the left hemisphere than in the right, and the left and right hemispheres showing respectively greater sensitivity to short and long contexts. The use of information-restricted NLP models thus shed new light on the spatial organization of syntactic processing, semantic processing and compositionality.

View on arXiv PDF Code

Similar