CLOct 4, 2022

Towards Improving Faithfulness in Abstractive Summarization

Xiuying Chen, Mingzhe Li, Xin Gao, Xiangliang Zhang

arXiv:2210.01877v16.737 citationsh-index: 58Has Code

Originality Incremental advance

AI Analysis

This addresses the issue of generating inaccurate summaries for users relying on automated summarization tools, but it is incremental as it builds on existing neural methods.

The paper tackles the problem of unfaithfulness in abstractive summarization by proposing a Faithfulness Enhanced Summarization model (FES) that uses question-answering to ensure understanding and a max-margin loss to prevent over-reliance on language models, resulting in significant outperformance over baselines on CNN/DM and XSum datasets with improved factual consistency.

Despite the success achieved in neural abstractive summarization based on pre-trained language models, one unresolved issue is that the generated summaries are not always faithful to the input document. There are two possible causes of the unfaithfulness problem: (1) the summarization model fails to understand or capture the gist of the input text, and (2) the model over-relies on the language model to generate fluent but inadequate words. In this work, we propose a Faithfulness Enhanced Summarization model (FES), which is designed for addressing these two problems and improving faithfulness in abstractive summarization. For the first problem, we propose to use question-answering (QA) to examine whether the encoder fully grasps the input document and can answer the questions on the key information in the input. The QA attention on the proper input words can also be used to stipulate how the decoder should attend to the source. For the second problem, we introduce a max-margin loss defined on the difference between the language and the summarization model, aiming to prevent the overconfidence of the language model. Extensive experiments on two benchmark summarization datasets, CNN/DM and XSum, demonstrate that our model significantly outperforms strong baselines. The evaluation of factual consistency also shows that our model generates more faithful summaries than baselines.

View on arXiv PDF Code

Similar