CLDec 2, 2021

CO2Sum:Contrastive Learning for Factual-Consistent Abstractive Summarization

arXiv:2112.01147v220 citations
AI Analysis

This addresses the challenge of factual consistency in summarization for NLP applications, representing an incremental improvement over existing methods.

The paper tackles the problem of generating factual-consistent summaries in abstractive summarization by proposing CO2Sum, a contrastive learning scheme applied to sequence-to-sequence models, which improves faithfulness on benchmarks and achieves competitive results compared to baselines.

Generating factual-consistent summaries is a challenging task for abstractive summarization. Previous works mainly encode factual information or perform post-correct/rank after decoding. In this paper, we provide a factual-consistent solution from the perspective of contrastive learning, which is a natural extension of previous works. We propose CO2Sum (Contrastive for Consistency), a contrastive learning scheme that can be easily applied on sequence-to-sequence models for factual-consistent abstractive summarization, proving that the model can be fact-aware without modifying the architecture. CO2Sum applies contrastive learning on the encoder, which can help the model be aware of the factual information contained in the input article, or performs contrastive learning on the decoder, which makes the model to generate factual-correct output summary. What's more, these two schemes are orthogonal and can be combined to further improve faithfulness. Comprehensive experiments on public benchmarks demonstrate that CO2Sum improves the faithfulness on large pre-trained language models and reaches competitive results compared to other strong factual-consistent summarization baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes