CLAILGOct 16, 2024

Reverse-Engineering the Reader

arXiv:2410.13086v124 citationsh-index: 25EMNLP
Originality Incremental advance
AI Analysis

This addresses the challenge of creating more cognitively plausible AI models for researchers in psycholinguistics and cognitive science, though it is incremental as it builds on prior observations of trade-offs.

The paper tackles the problem of optimizing language models to better predict human reading times by aligning them with psychometric data, finding that this improves psychometric predictive power but inversely affects performance on NLP tasks and perplexity.

Numerous previous studies have sought to determine to what extent language models, pretrained on natural language text, can serve as useful models of human cognition. In this paper, we are interested in the opposite question: whether we can directly optimize a language model to be a useful cognitive model by aligning it to human psychometric data. To achieve this, we introduce a novel alignment technique in which we fine-tune a language model to implicitly optimize the parameters of a linear regressor that directly predicts humans' reading times of in-context linguistic units, e.g., phonemes, morphemes, or words, using surprisal estimates derived from the language model. Using words as a test case, we evaluate our technique across multiple model sizes and datasets and find that it improves language models' psychometric predictive power. However, we find an inverse relationship between psychometric power and a model's performance on downstream NLP tasks as well as its perplexity on held-out test data. While this latter trend has been observed before (Oh et al., 2022; Shain et al., 2024), we are the first to induce it by manipulating a model's alignment to psychometric data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes