LG AI CL MLMay 18, 2018

Improved Sentence Modeling using Suffix Bidirectional LSTM

arXiv:1805.07340v25.728 citations

Originality Incremental advance

AI Analysis

This addresses a known bottleneck in NLP models for tasks requiring sentence modeling, offering incremental improvements over existing methods.

The authors tackled the problem of sequential bias in Bidirectional LSTMs by proposing Suffix Bidirectional LSTM (SuBiLSTM), which encodes suffixes and prefixes to favor long-range dependencies, leading to new state-of-the-art results in tasks like fine-grained sentiment classification and question classification.

Recurrent neural networks have become ubiquitous in computing representations of sequential data, especially textual data in natural language processing. In particular, Bidirectional LSTMs are at the heart of several neural models achieving state-of-the-art performance in a wide variety of tasks in NLP. However, BiLSTMs are known to suffer from sequential bias - the contextual representation of a token is heavily influenced by tokens close to it in a sentence. We propose a general and effective improvement to the BiLSTM model which encodes each suffix and prefix of a sequence of tokens in both forward and reverse directions. We call our model Suffix Bidirectional LSTM or SuBiLSTM. This introduces an alternate bias that favors long range dependencies. We apply SuBiLSTMs to several tasks that require sentence modeling. We demonstrate that using SuBiLSTM instead of a BiLSTM in existing models leads to improvements in performance in learning general sentence representations, text classification, textual entailment and paraphrase detection. Using SuBiLSTM we achieve new state-of-the-art results for fine-grained sentiment classification and question classification.

View on arXiv PDF

Similar