CLNov 26, 2018

Combining neural and knowledge-based approaches to Named Entity Recognition in Polish

arXiv:1811.10418v13 citations
Originality Incremental advance
AI Analysis

This work addresses NER for Polish, a domain-specific language task, by integrating external knowledge, showing incremental improvement over existing methods.

The authors tackled Named Entity Recognition (NER) for Polish by combining neural models with knowledge-based features and entity linking to Wikipedia, achieving state-of-the-art results with a 22.4% reduction in error rate compared to the winning solution on the PolEval 2018 challenge.

Named entity recognition (NER) is one of the tasks in natural language processing that can greatly benefit from the use of external knowledge sources. We propose a named entity recognition framework composed of knowledge-based feature extractors and a deep learning model including contextual word embeddings, long short-term memory (LSTM) layers and conditional random fields (CRF) inference layer. We use an entity linking module to integrate our system with Wikipedia. The combination of effective neural architecture and external resources allows us to obtain state-of-the-art results on recognition of Polish proper names. We evaluate our model on data from PolEval 2018 NER challenge on which it outperforms other methods, reducing the error rate by 22.4% compared to the winning solution. Our work shows that combining neural NER model and entity linking model with a knowledge base is more effective in recognizing named entities than using NER model alone.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes