CLLGFeb 5, 2019

The Referential Reader: A Recurrent Entity Network for Anaphora Resolution

arXiv:1902.01541v21103 citations
AI Analysis

This work addresses anaphora resolution for natural language processing applications, representing an incremental improvement with a novel method for a known bottleneck.

The authors tackled the problem of anaphora resolution in online text processing by introducing a recurrent entity network architecture that stores and accesses entity mentions through differentiable gates, achieving strong performance on a pronoun-name anaphora dataset with incremental processing.

We present a new architecture for storing and accessing entity mentions during online text processing. While reading the text, entity references are identified, and may be stored by either updating or overwriting a cell in a fixed-length memory. The update operation implies coreference with the other mentions that are stored in the same cell; the overwrite operation causes these mentions to be forgotten. By encoding the memory operations as differentiable gates, it is possible to train the model end-to-end, using both a supervised anaphora resolution objective as well as a supplementary language modeling objective. Evaluation on a dataset of pronoun-name anaphora demonstrates strong performance with purely incremental text processing.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes