CLMay 6, 2022

When a sentence does not introduce a discourse entity, Transformer-based models still sometimes refer to it

arXiv:2205.03472v1637 citationsh-index: 40
AI Analysis

This addresses a problem for natural language understanding in AI, showing incremental limitations in large language models' ability to handle fundamental linguistic phenomena.

The study investigated whether Transformer-based models like GPT-2 and GPT-3 can track discourse entities when indefinite noun phrases are affected by sentential operators like negation, finding that the models are partially sensitive but challenged by multiple NPs and lack systematicity, indicating they do not fully acquire basic entity tracking abilities.

Understanding longer narratives or participating in conversations requires tracking of discourse entities that have been mentioned. Indefinite noun phrases (NPs), such as 'a dog', frequently introduce discourse entities but this behavior is modulated by sentential operators such as negation. For example, 'a dog' in 'Arthur doesn't own a dog' does not introduce a discourse entity due to the presence of negation. In this work, we adapt the psycholinguistic assessment of language models paradigm to higher-level linguistic phenomena and introduce an English evaluation suite that targets the knowledge of the interactions between sentential operators and indefinite NPs. We use this evaluation suite for a fine-grained investigation of the entity tracking abilities of the Transformer-based models GPT-2 and GPT-3. We find that while the models are to a certain extent sensitive to the interactions we investigate, they are all challenged by the presence of multiple NPs and their behavior is not systematic, which suggests that even models at the scale of GPT-3 do not fully acquire basic entity tracking abilities.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes