CL AINov 22, 2022

Coreference Resolution through a seq2seq Transition-Based System

Bernd Bohnet, Chris Alberti, Michael Collins

DeepMind

arXiv:2211.12142v122.7239 citationsh-index: 26Has Code

Originality Highly original

AI Analysis

This work addresses coreference resolution, a key NLP task for improving text understanding, by proposing a novel method that outperforms previous approaches across multiple languages.

The paper tackles coreference resolution by introducing a seq2seq transition-based system that jointly predicts mentions and links, achieving state-of-the-art accuracy with F1-scores of 83.3 for English, 68.5 for Arabic, and 74.3 for Chinese on CoNLL-2012 datasets.

Most recent coreference resolution systems use search algorithms over possible spans to identify mentions and resolve coreference. We instead present a coreference resolution system that uses a text-to-text (seq2seq) paradigm to predict mentions and links jointly. We implement the coreference system as a transition system and use multilingual T5 as an underlying language model. We obtain state-of-the-art accuracy on the CoNLL-2012 datasets with 83.3 F1-score for English (a 2.3 higher F1-score than previous work (Dobrovolskii, 2021)) using only CoNLL data for training, 68.5 F1-score for Arabic (+4.1 higher than previous work) and 74.3 F1-score for Chinese (+5.3). In addition we use the SemEval-2010 data sets for experiments in the zero-shot setting, a few-shot setting, and supervised setting using all available training data. We get substantially higher zero-shot F1-scores for 3 out of 4 languages than previous approaches and significantly exceed previous supervised state-of-the-art results for all five tested languages.

View on arXiv PDF Code

Similar