CLApr 17, 2021

Moving on from OntoNotes: Coreference Resolution Model Transfer

arXiv:2104.08457v2664 citations
Originality Incremental advance
AI Analysis

This addresses the problem of domain mismatch in coreference resolution for NLP practitioners, though it is incremental as it builds on existing transfer methods.

The study quantified how well coreference resolution models trained on OntoNotes transfer to other datasets, finding that continued training is effective, especially with few target documents, and achieved state-of-the-art results on PreCo.

Academic neural models for coreference resolution (coref) are typically trained on a single dataset, OntoNotes, and model improvements are benchmarked on that same dataset. However, real-world applications of coref depend on the annotation guidelines and the domain of the target dataset, which often differ from those of OntoNotes. We aim to quantify transferability of coref models based on the number of annotated documents available in the target dataset. We examine eleven target datasets and find that continued training is consistently effective and especially beneficial when there are few target documents. We establish new benchmarks across several datasets, including state-of-the-art results on PreCo.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes