CLDec 19, 2020

Uncertainty-Aware Label Refinement for Sequence Labeling

arXiv:2012.10608v11000 citations
AI Analysis

This work addresses the limitations of local label dependencies and inefficient decoding in CRF-based sequence labeling for researchers and practitioners, offering a more efficient and accurate alternative.

This paper proposes a two-stage label decoding framework for sequence labeling that models long-term label dependencies and achieves faster inference than CRF-based methods. It uses a base model for draft predictions, refined by a two-stream self-attention model, and incorporates Bayesian neural networks to identify and mitigate incorrect draft labels.

Conditional random fields (CRF) for label decoding has become ubiquitous in sequence labeling tasks. However, the local label dependencies and inefficient Viterbi decoding have always been a problem to be solved. In this work, we introduce a novel two-stage label decoding framework to model long-term label dependencies, while being much more computationally efficient. A base model first predicts draft labels, and then a novel two-stream self-attention model makes refinements on these draft predictions based on long-range label dependencies, which can achieve parallel decoding for a faster prediction. In addition, in order to mitigate the side effects of incorrect draft labels, Bayesian neural networks are used to indicate the labels with a high probability of being wrong, which can greatly assist in preventing error propagation. The experimental results on three sequence labeling benchmarks demonstrated that the proposed method not only outperformed the CRF-based methods but also greatly accelerated the inference process.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes