CLMar 6, 2022

A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization

Jacob Parnell, Inigo Jauregi Unanue, Massimo Piccardi

arXiv:2203.02894v131.9637 citationsh-index: 33Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of improving summary quality and coverage for multi-document summarization, though it is incremental as it builds on existing RL and baseline methods.

The paper tackled the limitation of multi-document summarization models by fine-tuning a baseline with a reward balancing ROUGE and input document coverage, resulting in improvements of up to +0.95 pp average ROUGE and +3.17 pp METEOR scores over the baseline.

Multi-document summarization (MDS) has made significant progress in recent years, in part facilitated by the availability of new, dedicated datasets and capacious language models. However, a standing limitation of these models is that they are trained against limited references and with plain maximum-likelihood objectives. As for many other generative tasks, reinforcement learning (RL) offers the potential to improve the training of MDS models; yet, it requires a carefully-designed reward that can ensure appropriate leverage of both the reference summaries and the input documents. For this reason, in this paper we propose fine-tuning an MDS baseline with a reward that balances a reference-based metric such as ROUGE with coverage of the input documents. To implement the approach, we utilize RELAX (Grathwohl et al., 2018), a contemporary gradient estimator which is both low-variance and unbiased, and we fine-tune the baseline in a few-shot style for both stability and computational efficiency. Experimental results over the Multi-News and WCEP MDS datasets show significant improvements of up to +0.95 pp average ROUGE score and +3.17 pp METEOR score over the baseline, and competitive results with the literature. In addition, they show that the coverage of the input documents is increased, and evenly across all documents.

View on arXiv PDF Code

Similar