CLLGMay 6, 2020

PeTra: A Sparsely Supervised Memory Model for People Tracking

arXiv:2005.02990v1997 citations
Originality Incremental advance
AI Analysis

This work addresses people tracking in natural language processing, which is incremental as it builds on existing memory models with simplifications and new evaluations.

The authors tackled the problem of tracking entities in text with sparse supervision, proposing PeTra, a memory-augmented neural network that outperforms a prior model on pronoun resolution while using a simpler architecture.

We propose PeTra, a memory-augmented neural network designed to track entities in its memory slots. PeTra is trained using sparse annotation from the GAP pronoun resolution dataset and outperforms a prior memory model on the task while using a simpler architecture. We empirically compare key modeling choices, finding that we can simplify several aspects of the design of the memory module while retaining strong performance. To measure the people tracking capability of memory models, we (a) propose a new diagnostic evaluation based on counting the number of unique entities in text, and (b) conduct a small scale human evaluation to compare evidence of people tracking in the memory logs of PeTra relative to a previous approach. PeTra is highly effective in both evaluations, demonstrating its ability to track people in its memory despite being trained with limited annotation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes