CLJul 17, 2018

Improving Named Entity Recognition by Jointly Learning to Disambiguate Morphological Tags

arXiv:1807.06683v11095 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of NER for languages lacking external morphological disambiguation tools, though it is incremental as it builds on prior work using linguistic features.

The paper tackles the problem of named entity recognition (NER) in morphologically rich languages by proposing a model that jointly learns NER and morphological disambiguation (MD) to eliminate the need for external MD tools, showing that joint learning improves NER performance and achieves competitive MD results.

Previous studies have shown that linguistic features of a word such as possession, genitive or other grammatical cases can be employed in word representations of a named entity recognition (NER) tagger to improve the performance for morphologically rich languages. However, these taggers require external morphological disambiguation (MD) tools to function which are hard to obtain or non-existent for many languages. In this work, we propose a model which alleviates the need for such disambiguators by jointly learning NER and MD taggers in languages for which one can provide a list of candidate morphological analyses. We show that this can be done independent of the morphological annotation schemes, which differ among languages. Our experiments employing three different model architectures that join these two tasks show that joint learning improves NER performance. Furthermore, the morphological disambiguator's performance is shown to be competitive.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes