CLMay 2, 2018

Constituency Parsing with a Self-Attentive Encoder

arXiv:1805.01052v11295 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses parsing accuracy for natural language processing researchers, offering incremental improvements through architectural changes.

The paper tackled constituency parsing by replacing an LSTM encoder with a self-attentive architecture, achieving new state-of-the-art results of 93.55 F1 on the Penn Treebank without external data and 95.13 F1 with pre-trained word representations, and outperforming previous best accuracies on 8 of 9 languages in the SPMRL dataset.

We demonstrate that replacing an LSTM encoder with a self-attentive architecture can lead to improvements to a state-of-the-art discriminative constituency parser. The use of attention makes explicit the manner in which information is propagated between different locations in the sentence, which we use to both analyze our model and propose potential improvements. For example, we find that separating positional and content information in the encoder can lead to improved parsing accuracy. Additionally, we evaluate different approaches for lexical representation. Our parser achieves new state-of-the-art results for single models trained on the Penn Treebank: 93.55 F1 without the use of any external data, and 95.13 F1 when using pre-trained word representations. Our parser also outperforms the previous best-published accuracy figures on 8 of the 9 languages in the SPMRL dataset.

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes