CLOct 15, 2021

Modeling Endorsement for Multi-Document Abstractive Summarization

Logan Lebanoff, Bingqing Wang, Zhe Feng, Fei Liu

arXiv:2110.07844v130.7662 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the challenge of efficiently summarizing multiple documents for applications like news aggregation, though it is incremental in improving existing neural models.

The paper tackled the problem of identifying salient content in multi-document summarization by modeling cross-document endorsement effects, resulting in a method that outperforms strong baselines on benchmark datasets.

A crucial difference between single- and multi-document summarization is how salient content manifests itself in the document(s). While such content may appear at the beginning of a single document, essential information is frequently reiterated in a set of documents related to a particular topic, resulting in an endorsement effect that increases information salience. In this paper, we model the cross-document endorsement effect and its utilization in multiple document summarization. Our method generates a synopsis from each document, which serves as an endorser to identify salient content from other documents. Strongly endorsed text segments are used to enrich a neural encoder-decoder model to consolidate them into an abstractive summary. The method has a great potential to learn from fewer examples to identify salient content, which alleviates the need for costly retraining when the set of documents is dynamically adjusted. Through extensive experiments on benchmark multi-document summarization datasets, we demonstrate the effectiveness of our proposed method over strong published baselines. Finally, we shed light on future research directions and discuss broader challenges of this task using a case study.

View on arXiv PDF Code

Similar