CLLGAug 24, 2019

Position-Aware Self-Attention based Neural Sequence Labeling

arXiv:1908.09128v227 citations
AI Analysis

This addresses a limitation in RNN-based sequence labeling for NLP tasks, offering improved performance, though it appears incremental as it builds on existing attention mechanisms.

The paper tackled the problem of capturing non-continuous token relations in sequence labeling by proposing a position-aware self-attention model, which outperformed state-of-the-art methods on tasks like POS tagging, NER, and phrase chunking without external knowledge.

Sequence labeling is a fundamental task in natural language processing and has been widely studied. Recently, RNN-based sequence labeling models have increasingly gained attentions. Despite superior performance achieved by learning the long short-term (i.e., successive) dependencies, the way of sequentially processing inputs might limit the ability to capture the non-continuous relations over tokens within a sentence. To tackle the problem, we focus on how to effectively model successive and discrete dependencies of each token for enhancing the sequence labeling performance. Specifically, we propose an innovative attention-based model (called position-aware selfattention, i.e., PSA) as well as a well-designed self-attentional context fusion layer within a neural network architecture, to explore the positional information of an input sequence for capturing the latent relations among tokens. Extensive experiments on three classical tasks in sequence labeling domain, i.e., partof-speech (POS) tagging, named entity recognition (NER) and phrase chunking, demonstrate our proposed model outperforms the state-of-the-arts without any external knowledge, in terms of various metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes