CLDec 5, 2022

Transformer-Based Named Entity Recognition for French Using Adversarial Adaptation to Similar Domain Corpora

arXiv:2212.03692v15 citationsh-index: 41
Originality Incremental advance
AI Analysis

This work addresses the problem of limited labelled data for French NER, offering a domain adaptation solution that is incremental in nature.

The paper tackled Named Entity Recognition for French, a low-resource language, by using adversarial adaptation to similar domain corpora, resulting in improved performance over non-adaptive models across multiple datasets and transformer combinations.

Named Entity Recognition (NER) involves the identification and classification of named entities in unstructured text into predefined classes. NER in languages with limited resources, like French, is still an open problem due to the lack of large, robust, labelled datasets. In this paper, we propose a transformer-based NER approach for French using adversarial adaptation to similar domain or general corpora for improved feature extraction and better generalization. We evaluate our approach on three labelled datasets and show that our adaptation framework outperforms the corresponding non-adaptive models for various combinations of transformer models, source datasets and target corpora.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes