CL AISep 2, 2015

A Neural Attention Model for Abstractive Sentence Summarization

Alexander M. Rush, Sumit Chopra, Jason Weston

arXiv:1509.00685v251.32840 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of generating concise summaries for natural language processing applications, representing an incremental improvement over existing methods.

The authors tackled the challenge of abstractive sentence summarization by proposing a fully data-driven neural attention model that generates summaries word by word, achieving significant performance gains on the DUC-2004 shared task compared to strong baselines.

Summarization based on text extraction is inherently limited, but generation-style abstractive methods have proven challenging to build. In this work, we propose a fully data-driven approach to abstractive sentence summarization. Our method utilizes a local attention-based model that generates each word of the summary conditioned on the input sentence. While the model is structurally simple, it can easily be trained end-to-end and scales to a large amount of training data. The model shows significant performance gains on the DUC-2004 shared task compared with several strong baselines.

View on arXiv PDF

Similar