CLApr 4, 2019

Guiding Extractive Summarization with Question-Answering Rewards

arXiv:1904.02321v131.31119 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the problem of data scarcity for extractive summarization, offering a novel training approach that could benefit NLP researchers and practitioners, though it is incremental in leveraging existing QA methods.

The paper tackles the lack of ground-truth data for supervised extractive summarization by using question-answering rewards derived from human abstracts to guide the system, resulting in summaries that perform competitively with strong baselines in automatic and human evaluations.

Highlighting while reading is a natural behavior for people to track salient content of a document. It would be desirable to teach an extractive summarizer to do the same. However, a major obstacle to the development of a supervised summarizer is the lack of ground-truth. Manual annotation of extraction units is cost-prohibitive, whereas acquiring labels by automatically aligning human abstracts and source documents can yield inferior results. In this paper we describe a novel framework to guide a supervised, extractive summarization system with question-answering rewards. We argue that quality summaries should serve as a document surrogate to answer important questions, and such question-answer pairs can be conveniently obtained from human abstracts. The system learns to promote summaries that are informative, fluent, and perform competitively on question-answering. Our results compare favorably with those reported by strong summarization baselines as evaluated by automatic metrics and human assessors.

View on arXiv PDF Code

Similar