CLDec 19, 2022

Unsupervised Summarization Re-ranking

arXiv:2212.09593v421.3223 citationsh-index: 62Has Code

Originality Incremental advance

AI Analysis

This work addresses the performance gap between unsupervised and supervised summarization models, offering incremental improvements for NLP researchers and practitioners.

The paper tackles the problem of high variance in quality among summary candidates from unsupervised summarization models like PEGASUS and ChatGPT, proposing an unsupervised re-ranking method that improves performance by up to 7.27% for PEGASUS and 6.86% for ChatGPT in mean ROUGE across benchmarks, with gains averaging 7.51% over 30 zero-shot transfer setups.

With the rise of task-specific pre-training objectives, abstractive summarization models like PEGASUS offer appealing zero-shot performance on downstream summarization tasks. However, the performance of such unsupervised models still lags significantly behind their supervised counterparts. Similarly to the supervised setup, we notice a very high variance in quality among summary candidates from these models while only one candidate is kept as the summary output. In this paper, we propose to re-rank summary candidates in an unsupervised manner, aiming to close the performance gap between unsupervised and supervised models. Our approach improves the unsupervised PEGASUS by up to 7.27% and ChatGPT by up to 6.86% relative mean ROUGE across four widely-adopted summarization benchmarks ; and achieves relative gains of 7.51% (up to 23.73% from XSum to WikiHow) averaged over 30 zero-shot transfer setups (finetuning on a dataset, evaluating on another).

View on arXiv PDF Code

Similar