CLNov 29, 2023

Supervising the Centroid Baseline for Extractive Multi-Document Summarization

arXiv:2311.17771v1132 citationsh-index: 2
Originality Synthesis-oriented
AI Analysis

This work addresses the need for better summarization tools in multi-document and multilingual contexts, but it appears incremental as it builds upon existing centroid-based methods.

The paper tackled the problem of extractive multi-document summarization by refining the centroid method with beam search and an attention model, resulting in improved performance across multiple datasets, including in multilingual scenarios, though specific numerical gains are not detailed in the abstract.

The centroid method is a simple approach for extractive multi-document summarization and many improvements to its pipeline have been proposed. We further refine it by adding a beam search process to the sentence selection and also a centroid estimation attention model that leads to improved results. We demonstrate this in several multi-document summarization datasets, including in a multilingual scenario.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes