CLAIApr 28, 2015

Reader-Aware Multi-Document Summarization via Sparse Coding

arXiv:1504.07324v113 citations
Originality Incremental advance
AI Analysis

This addresses summarization for news readers by integrating user feedback, though it is incremental as it builds on existing sparse coding methods.

The paper tackles multi-document summarization by incorporating reader comments to improve salience and linguistic quality, achieving competitive results on new and classical datasets.

We propose a new MDS paradigm called reader-aware multi-document summarization (RA-MDS). Specifically, a set of reader comments associated with the news reports are also collected. The generated summaries from the reports for the event should be salient according to not only the reports but also the reader comments. To tackle this RA-MDS problem, we propose a sparse-coding-based method that is able to calculate the salience of the text units by jointly considering news reports and reader comments. Another reader-aware characteristic of our framework is to improve linguistic quality via entity rewriting. The rewriting consideration is jointly assessed together with other summarization requirements under a unified optimization model. To support the generation of compressive summaries via optimization, we explore a finer syntactic unit, namely, noun/verb phrase. In this work, we also generate a data set for conducting RA-MDS. Extensive experiments on this data set and some classical data sets demonstrate the effectiveness of our proposed approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes