CLAIOct 23, 2022

Improving Chinese Named Entity Recognition by Search Engine Augmentation

arXiv:2210.12662v12 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving NER accuracy in Chinese, particularly for rare and emerging entities, which is incremental as it builds on existing semantic augmentation methods.

The paper tackles the problem of Chinese named entity recognition (NER) by augmenting contextual information with external knowledge from search engines to address grammatical ambiguities and rare entities, achieving superior performance across four datasets.

Compared with English, Chinese suffers from more grammatical ambiguities, like fuzzy word boundaries and polysemous words. In this case, contextual information is not sufficient to support Chinese named entity recognition (NER), especially for rare and emerging named entities. Semantic augmentation using external knowledge is a potential way to alleviate this problem, while how to obtain and leverage external knowledge for the NER task remains a challenge. In this paper, we propose a neural-based approach to perform semantic augmentation using external knowledge from search engine for Chinese NER. In particular, a multi-channel semantic fusion model is adopted to generate the augmented input representations, which aggregates external related texts retrieved from the search engine. Experiments have shown the superiority of our model across 4 NER datasets, including formal and social media language contexts, which further prove the effectiveness of our approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes