CLMay 15, 2018

Simplifying Sentences with Sequence to Sequence Models

arXiv:1805.05557v12 citations
Originality Incremental advance
AI Analysis

This work addresses sentence simplification for improving text accessibility, but it is incremental as it builds on existing sequence-to-sequence methods.

The authors tackled the problem of sentence simplification using an attentive neural sequence-to-sequence model with a novel word-copy mechanism and joint embeddings, achieving an 8.8 point BLEU score improvement over a baseline on news article data.

We simplify sentences with an attentive neural network sequence to sequence model, dubbed S4. The model includes a novel word-copy mechanism and loss function to exploit linguistic similarities between the original and simplified sentences. It also jointly uses pre-trained and fine-tuned word embeddings to capture the semantics of complex sentences and to mitigate the effects of limited data. When trained and evaluated on pairs of sentences from thousands of news articles, we observe a 8.8 point improvement in BLEU score over a sequence to sequence baseline; however, learning word substitutions remains difficult. Such sequence to sequence models are promising for other text generation tasks such as style transfer.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes