CLMay 15, 2021

Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter

arXiv:2105.07148v3720 citations
Originality Incremental advance
AI Analysis

This work addresses the need for deeper lexicon fusion in Chinese NLP tasks, offering an incremental improvement over existing methods.

The paper tackled the problem of shallow integration of lexicon information in Chinese sequence labeling by proposing LEBERT, which integrates lexicon knowledge directly into BERT layers using a Lexicon Adapter, achieving state-of-the-art results on ten datasets across three tasks.

Lexicon information and pre-trained models, such as BERT, have been combined to explore Chinese sequence labelling tasks due to their respective strengths. However, existing methods solely fuse lexicon features via a shallow and random initialized sequence layer and do not integrate them into the bottom layers of BERT. In this paper, we propose Lexicon Enhanced BERT (LEBERT) for Chinese sequence labelling, which integrates external lexicon knowledge into BERT layers directly by a Lexicon Adapter layer. Compared with the existing methods, our model facilitates deep lexicon knowledge fusion at the lower layers of BERT. Experiments on ten Chinese datasets of three tasks including Named Entity Recognition, Word Segmentation, and Part-of-Speech tagging, show that LEBERT achieves the state-of-the-art results.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes