CLNov 2, 2020

Sequence-to-Sequence Networks Learn the Meaning of Reflexive Anaphora

arXiv:2011.00682v1990 citations
AI Analysis

This addresses a specific problem in natural language processing for linguists and AI researchers, but it is incremental as it builds on past doubts about recurrent networks.

The paper tackles the challenge of whether sequence-to-sequence networks can learn the semantic interpretation of reflexive anaphora, which vary with context, and shows that these networks can learn and generalize this interpretation to novel antecedents.

Reflexive anaphora present a challenge for semantic interpretation: their meaning varies depending on context in a way that appears to require abstract variables. Past work has raised doubts about the ability of recurrent networks to meet this challenge. In this paper, we explore this question in the context of a fragment of English that incorporates the relevant sort of contextual variability. We consider sequence-to-sequence architectures with recurrent units and show that such networks are capable of learning semantic interpretations for reflexive anaphora which generalize to novel antecedents. We explore the effect of attention mechanisms and different recurrent unit types on the type of training data that is needed for success as measured in two ways: how much lexical support is needed to induce an abstract reflexive meaning (i.e., how many distinct reflexive antecedents must occur during training) and what contexts must a noun phrase occur in to support generalization of reflexive interpretation to this noun phrase?

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes