CLAIMar 21, 2022

Leveraging Expert Guided Adversarial Augmentation For Improving Generalization in Named Entity Recognition

Georgia Tech
arXiv:2203.10693v1639 citationsh-index: 34Has Code
Originality Incremental advance
AI Analysis

This addresses generalization issues in NER for NLP practitioners, but it is incremental as it builds on existing adversarial and augmentation techniques.

The paper tackled the problem of poor generalization in Named Entity Recognition (NER) models on distribution-shifted data by proposing expert-guided adversarial augmentation, which improved performance on a challenging test set and out-of-domain data, with significant gains reported.

Named Entity Recognition (NER) systems often demonstrate great performance on in-distribution data, but perform poorly on examples drawn from a shifted distribution. One way to evaluate the generalization ability of NER models is to use adversarial examples, on which the specific variations associated with named entities are rarely considered. To this end, we propose leveraging expert-guided heuristics to change the entity tokens and their surrounding contexts thereby altering their entity types as adversarial attacks. Using expert-guided heuristics, we augmented the CoNLL 2003 test set and manually annotated it to construct a high-quality challenging set. We found that state-of-the-art NER systems trained on CoNLL 2003 training data drop performance dramatically on our challenging set. By training on adversarial augmented training examples and using mixup for regularization, we were able to significantly improve the performance on the challenging set as well as improve out-of-domain generalization which we evaluated by using OntoNotes data. We have publicly released our dataset and code at https://github.com/GT-SALT/Guided-Adversarial-Augmentation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes