CLMay 26, 2021

BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition

arXiv:2105.12848v2714 citations
Originality Highly original
AI Analysis

This addresses the problem of obtaining accurate NER models with cheap but noisy labels for researchers and practitioners, representing a strong specific gain rather than a foundational advancement.

The paper tackles learning named entity recognition (NER) from noisy multi-source weak supervision by proposing a conditional hidden Markov model (CHMM) enhanced with BERT embeddings, which outperforms state-of-the-art weakly supervised NER models on four benchmarks.

We study the problem of learning a named entity recognition (NER) tagger using noisy labels from multiple weak supervision sources. Though cheap to obtain, the labels from weak supervision sources are often incomplete, inaccurate, and contradictory, making it difficult to learn an accurate NER model. To address this challenge, we propose a conditional hidden Markov model (CHMM), which can effectively infer true labels from multi-source noisy labels in an unsupervised way. CHMM enhances the classic hidden Markov model with the contextual representation power of pre-trained language models. Specifically, CHMM learns token-wise transition and emission probabilities from the BERT embeddings of the input tokens to infer the latent true labels from noisy observations. We further refine CHMM with an alternate-training approach (CHMM-ALT). It fine-tunes a BERT-NER model with the labels inferred by CHMM, and this BERT-NER's output is regarded as an additional weak source to train the CHMM in return. Experiments on four NER benchmarks from various domains show that our method outperforms state-of-the-art weakly supervised NER models by wide margins.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes