SpanNER: Named Entity Re-/Recognition as Span Prediction
This work addresses the need for better NER systems by combining paradigms, but it is incremental as it builds on existing span prediction methods.
The paper tackles the problem of understanding and improving Named Entity Recognition (NER) by comparing span prediction and sequence labeling paradigms, and shows that span prediction models are effective as both base NER systems and combiners for re-recognizing entities from multiple systems, with experiments on 154 systems across 11 datasets in three languages.
Recent years have seen the paradigm shift of Named Entity Recognition (NER) systems from sequence labeling to span prediction. Despite its preliminary effectiveness, the span prediction model's architectural bias has not been fully understood. In this paper, we first investigate the strengths and weaknesses when the span prediction model is used for named entity recognition compared with the sequence labeling framework and how to further improve it, which motivates us to make complementary advantages of systems based on different paradigms. We then reveal that span prediction, simultaneously, can serve as a system combiner to re-recognize named entities from different systems' outputs. We experimentally implement 154 systems on 11 datasets, covering three languages, comprehensive results show the effectiveness of span prediction models that both serve as base NER systems and system combiners. We make all code and datasets available: \url{https://github.com/neulab/spanner}, as well as an online system demo: \url{http://spanner.sh}. Our model also has been deployed into the ExplainaBoard platform, which allows users to flexibly perform a system combination of top-scoring systems in an interactive way: \url{http://explainaboard.nlpedia.ai/leaderboard/task-ner/}.