CLAIOct 21, 2021

Topic-Guided Abstractive Multi-Document Summarization

arXiv:2110.11207v1666 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of generating coherent summaries from multiple documents for applications like news aggregation, though it is incremental as it builds on existing graph-to-sequence and topic modeling methods.

The paper tackles multi-document summarization by proposing a model that uses a heterogeneous graph and neural topic model to guide summary generation, achieving state-of-the-art performance on Rouge metrics and human evaluation on the Multi-News dataset.

A critical point of multi-document summarization (MDS) is to learn the relations among various documents. In this paper, we propose a novel abstractive MDS model, in which we represent multiple documents as a heterogeneous graph, taking semantic nodes of different granularities into account, and then apply a graph-to-sequence framework to generate summaries. Moreover, we employ a neural topic model to jointly discover latent topics that can act as cross-document semantic units to bridge different documents and provide global information to guide the summary generation. Since topic extraction can be viewed as a special type of summarization that "summarizes" texts into a more abstract format, i.e., a topic distribution, we adopt a multi-task learning strategy to jointly train the topic and summarization module, allowing the promotion of each other. Experimental results on the Multi-News dataset demonstrate that our model outperforms previous state-of-the-art MDS models on both Rouge metrics and human evaluation, meanwhile learns high-quality topics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes