CLSep 28, 2020

Reducing Quantity Hallucinations in Abstractive Summarization

Zheng Zhao, Shay B. Cohen, Bonnie Webber

arXiv:2009.13312v131.41015 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the issue of factual inaccuracies in summaries for users relying on automated summarization tools, though it is incremental as it builds on existing verification approaches.

The paper tackles the problem of quantity hallucinations in abstractive summarization by developing a system that verifies quantity entities in summaries against the original text, resulting in higher ROUGE precision and F1 scores without significant recall loss, with human evaluators preferring the up-ranked summaries.

It is well-known that abstractive summaries are subject to hallucination---including material that is not supported by the original text. While summaries can be made hallucination-free by limiting them to general phrases, such summaries would fail to be very informative. Alternatively, one can try to avoid hallucinations by verifying that any specific entities in the summary appear in the original text in a similar context. This is the approach taken by our system, Herman. The system learns to recognize and verify quantity entities (dates, numbers, sums of money, etc.) in a beam-worth of abstractive summaries produced by state-of-the-art models, in order to up-rank those summaries whose quantity terms are supported by the original text. Experimental results demonstrate that the ROUGE scores of such up-ranked summaries have a higher Precision than summaries that have not been up-ranked, without a comparable loss in Recall, resulting in higher F$_1$. Preliminary human evaluation of up-ranked vs. original summaries shows people's preference for the former.

View on arXiv PDF Code

Similar