CLJan 31, 2021

Contextualized Rewriting for Text Summarization

arXiv:2102.00385v217 citations
AI Analysis

This addresses the issue of losing important background knowledge in text summarization for users needing concise and readable summaries, though it appears incremental by building on existing abstractive rewriting approaches.

The paper tackles the problem of extractive summarization suffering from irrelevance, redundancy, and incoherence by proposing contextualized rewriting that uses the entire original document as input, achieving significant improvements in ROUGE scores over non-contextualized systems.

Extractive summarization suffers from irrelevance, redundancy and incoherence. Existing work shows that abstractive rewriting for extractive summaries can improve the conciseness and readability. These rewriting systems consider extracted summaries as the only input, which is relatively focused but can lose important background knowledge. In this paper, we investigate contextualized rewriting, which ingests the entire original document. We formalize contextualized rewriting as a seq2seq problem with group alignments, introducing group tag as a solution to model the alignments, identifying extracted summaries through content-based addressing. Results show that our approach significantly outperforms non-contextualized rewriting systems without requiring reinforcement learning, achieving strong improvements on ROUGE scores upon multiple extractive summarizers.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes