CLDec 19, 2022

Unsupervised Summarization Re-ranking

arXiv:2212.09593v4223 citationsh-index: 62
Originality Incremental advance
AI Analysis

This work addresses the performance gap between unsupervised and supervised summarization models, offering incremental improvements for NLP researchers and practitioners.

The paper tackles the problem of high variance in quality among summary candidates from unsupervised summarization models like PEGASUS and ChatGPT, proposing an unsupervised re-ranking method that improves performance by up to 7.27% for PEGASUS and 6.86% for ChatGPT in mean ROUGE across benchmarks, with gains averaging 7.51% over 30 zero-shot transfer setups.

With the rise of task-specific pre-training objectives, abstractive summarization models like PEGASUS offer appealing zero-shot performance on downstream summarization tasks. However, the performance of such unsupervised models still lags significantly behind their supervised counterparts. Similarly to the supervised setup, we notice a very high variance in quality among summary candidates from these models while only one candidate is kept as the summary output. In this paper, we propose to re-rank summary candidates in an unsupervised manner, aiming to close the performance gap between unsupervised and supervised models. Our approach improves the unsupervised PEGASUS by up to 7.27% and ChatGPT by up to 6.86% relative mean ROUGE across four widely-adopted summarization benchmarks ; and achieves relative gains of 7.51% (up to 23.73% from XSum to WikiHow) averaged over 30 zero-shot transfer setups (finetuning on a dataset, evaluating on another).

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes