CL AIOct 21, 2021

Topic-Guided Abstractive Multi-Document Summarization

arXiv:2110.11207v130.9666 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of generating coherent summaries from multiple documents for applications like news aggregation, though it is incremental as it builds on existing graph-to-sequence and topic modeling methods.

The paper tackles multi-document summarization by proposing a model that uses a heterogeneous graph and neural topic model to guide summary generation, achieving state-of-the-art performance on Rouge metrics and human evaluation on the Multi-News dataset.

A critical point of multi-document summarization (MDS) is to learn the relations among various documents. In this paper, we propose a novel abstractive MDS model, in which we represent multiple documents as a heterogeneous graph, taking semantic nodes of different granularities into account, and then apply a graph-to-sequence framework to generate summaries. Moreover, we employ a neural topic model to jointly discover latent topics that can act as cross-document semantic units to bridge different documents and provide global information to guide the summary generation. Since topic extraction can be viewed as a special type of summarization that "summarizes" texts into a more abstract format, i.e., a topic distribution, we adopt a multi-task learning strategy to jointly train the topic and summarization module, allowing the promotion of each other. Experimental results on the Multi-News dataset demonstrate that our model outperforms previous state-of-the-art MDS models on both Rouge metrics and human evaluation, meanwhile learns high-quality topics.

View on arXiv PDF

Similar