Evaluating Interactive Summarization: an Expansion-Based Framework
This work addresses the lack of comparable evaluation methods for interactive summarization, providing a benchmark for future research in this domain.
The paper tackles the problem of evaluating interactive multi-document summarization by developing an expansion-based framework that accounts for accumulating information in user sessions, and it demonstrates its use by evaluating baseline implementations to support its viability.
Allowing users to interact with multi-document summarizers is a promising direction towards improving and customizing summary results. Different ideas for interactive summarization have been proposed in previous work but these solutions are highly divergent and incomparable. In this paper, we develop an end-to-end evaluation framework for expansion-based interactive summarization, which considers the accumulating information along an interactive session. Our framework includes a procedure of collecting real user sessions and evaluation measures relying on standards, but adapted to reflect interaction. All of our solutions are intended to be released publicly as a benchmark, allowing comparison of future developments in interactive summarization. We demonstrate the use of our framework by evaluating and comparing baseline implementations that we developed for this purpose, which will serve as part of our benchmark. Our extensive experimentation and analysis of these systems motivate our design choices and support the viability of our framework.