CLAILGNCNov 15, 2023

Divergences between Language Models and Human Brains

CMU
arXiv:2311.09308v37 citationsh-index: 91
Originality Incremental advance
AI Analysis

This work addresses the problem of understanding differences in language processing between machines and humans, which is incremental as it builds on prior findings of similarities to highlight specific gaps.

The paper systematically explores divergences between language models (LMs) and human brains in language processing, identifying that LMs poorly capture social/emotional intelligence and physical commonsense, and shows that fine-tuning LMs on these domains improves alignment with human brain responses.

Do machines and humans process language in similar ways? Recent research has hinted at the affirmative, showing that human neural activity can be effectively predicted using the internal representations of language models (LMs). Although such results are thought to reflect shared computational principles between LMs and human brains, there are also clear differences in how LMs and humans represent and use language. In this work, we systematically explore the divergences between human and machine language processing by examining the differences between LM representations and human brain responses to language as measured by Magnetoencephalography (MEG) across two datasets in which subjects read and listened to narrative stories. Using an LLM-based data-driven approach, we identify two domains that LMs do not capture well: social/emotional intelligence and physical commonsense. We validate these findings with human behavioral experiments and hypothesize that the gap is due to insufficient representations of social/emotional and physical knowledge in LMs. Our results show that fine-tuning LMs on these domains can improve their alignment with human brain responses.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes