CLApr 12, 2021

Multilingual Language Models Predict Human Reading Behavior

arXiv:2104.05433v1727 citations
Originality Synthesis-oriented
AI Analysis

This provides insights into how AI models mimic human language comprehension, with potential applications in cognitive science and NLP, though it is incremental in applying existing models to new data.

The study investigated whether large language models can predict human reading behavior across multiple languages, finding that BERT and XLM models accurately predict eye-tracking features, indicating they encode language importance similarly to human processing.

We analyze if large language models are able to predict patterns of human reading behavior. We compare the performance of language-specific and multilingual pretrained transformer models to predict reading time measures reflecting natural human sentence processing on Dutch, English, German, and Russian texts. This results in accurate models of human reading behavior, which indicates that transformer models implicitly encode relative importance in language in a way that is comparable to human processing mechanisms. We find that BERT and XLM models successfully predict a range of eye tracking features. In a series of experiments, we analyze the cross-domain and cross-language abilities of these models and show how they reflect human sentence processing.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes