CLAILGMay 28, 2018

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

arXiv:1805.11080v11308 citations
Originality Highly original
AI Analysis

This addresses the problem of slow and inefficient abstractive summarization for NLP applications, representing a strong incremental improvement with practical speed gains.

The paper tackles abstractive document summarization by proposing a model that selects salient sentences and rewrites them abstractively, achieving new state-of-the-art results on CNN/Daily Mail with 10-20x faster inference and 4x faster training convergence.

Inspired by how humans summarize long documents, we propose an accurate and fast summarization model that first selects salient sentences and then rewrites them abstractively (i.e., compresses and paraphrases) to generate a concise overall summary. We use a novel sentence-level policy gradient method to bridge the non-differentiable computation between these two neural networks in a hierarchical way, while maintaining language fluency. Empirically, we achieve the new state-of-the-art on all metrics (including human evaluation) on the CNN/Daily Mail dataset, as well as significantly higher abstractiveness scores. Moreover, by first operating at the sentence-level and then the word-level, we enable parallel decoding of our neural generative model that results in substantially faster (10-20x) inference speed as well as 4x faster training convergence than previous long-paragraph encoder-decoder models. We also demonstrate the generalization of our model on the test-only DUC-2002 dataset, where we achieve higher scores than a state-of-the-art model.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes