CLJun 4, 2019

HighRES: Highlight-based Reference-less Evaluation of Summarization

arXiv:1906.01361v11108 citations
Originality Incremental advance
AI Analysis

This addresses evaluation challenges in summarization research, offering a more reliable manual assessment method, though it is incremental as it builds on existing datasets and systems.

The paper tackles the inconsistency in manual evaluation of summarization systems by proposing HighRES, a highlight-based reference-less evaluation method that improves inter-annotator agreement and reveals system differences ignored by other approaches.

There has been substantial progress in summarization research enabled by the availability of novel, often large-scale, datasets and recent advances on neural network-based approaches. However, manual evaluation of the system generated summaries is inconsistent due to the difficulty the task poses to human non-expert readers. To address this issue, we propose a novel approach for manual evaluation, Highlight-based Reference-less Evaluation of Summarization (HighRES), in which summaries are assessed by multiple annotators against the source document via manually highlighted salient content in the latter. Thus summary assessment on the source document by human judges is facilitated, while the highlights can be used for evaluating multiple systems. To validate our approach we employ crowd-workers to augment with highlights a recently proposed dataset and compare two state-of-the-art systems. We demonstrate that HighRES improves inter-annotator agreement in comparison to using the source document directly, while they help emphasize differences among systems that would be ignored under other evaluation approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes