LGCLMLMar 4, 2016

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

arXiv:1603.01354v52795 citations
Originality Highly original
AI Analysis

This addresses the need for automated, feature-free sequence labeling systems for NLP researchers and practitioners, representing a novel method rather than an incremental improvement.

The paper tackles the problem of sequence labeling tasks like part-of-speech tagging and named entity recognition by introducing an end-to-end neural network architecture combining bidirectional LSTM, CNN, and CRF, achieving state-of-the-art results of 97.55% accuracy and 91.21% F1 score.

State-of-the-art sequence labeling systems traditionally require large amounts of task-specific knowledge in the form of hand-crafted features and data pre-processing. In this paper, we introduce a novel neutral network architecture that benefits from both word- and character-level representations automatically, by using combination of bidirectional LSTM, CNN and CRF. Our system is truly end-to-end, requiring no feature engineering or data pre-processing, thus making it applicable to a wide range of sequence labeling tasks. We evaluate our system on two data sets for two sequence labeling tasks --- Penn Treebank WSJ corpus for part-of-speech (POS) tagging and CoNLL 2003 corpus for named entity recognition (NER). We obtain state-of-the-art performance on both the two data --- 97.55\% accuracy for POS tagging and 91.21\% F1 for NER.

Code Implementations25 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes