CLAISep 2, 2015

A Neural Attention Model for Abstractive Sentence Summarization

arXiv:1509.00685v22837 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of generating concise summaries for natural language processing applications, representing an incremental improvement over existing methods.

The authors tackled the challenge of abstractive sentence summarization by proposing a fully data-driven neural attention model that generates summaries word by word, achieving significant performance gains on the DUC-2004 shared task compared to strong baselines.

Summarization based on text extraction is inherently limited, but generation-style abstractive methods have proven challenging to build. In this work, we propose a fully data-driven approach to abstractive sentence summarization. Our method utilizes a local attention-based model that generates each word of the summary conditioned on the input sentence. While the model is structurally simple, it can easily be trained end-to-end and scales to a large amount of training data. The model shows significant performance gains on the DUC-2004 shared task compared with several strong baselines.

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes