CLApr 28, 2020

Active Learning for Coreference Resolution using Discrete Annotation

arXiv:2004.13671v31008 citationsHas Code
AI Analysis

This work addresses annotation efficiency for coreference resolution, offering a practical improvement for developing models in new domains.

The paper tackles the problem of inefficient annotation in coreference resolution by modifying active learning to include identifying antecedents for non-coreferent mention pairs, resulting in significant performance gains per annotation hour as demonstrated on benchmark datasets.

We improve upon pairwise annotation for active learning in coreference resolution, by asking annotators to identify mention antecedents if a presented mention pair is deemed not coreferent. This simple modification, when combined with a novel mention clustering algorithm for selecting which examples to label, is much more efficient in terms of the performance obtained per annotation budget. In experiments with existing benchmark coreference datasets, we show that the signal from this additional question leads to significant performance gains per human-annotation hour. Future work can use our annotation protocol to effectively develop coreference models for new domains. Our code is publicly available at https://github.com/belindal/discrete-active-learning-coref .

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes