CVJan 3, 2019

Demystifying Multi-Faceted Video Summarization: Tradeoff Between Diversity,Representation, Coverage and Importance

arXiv:1901.01153v10.0024 citations
AI Analysis15

This work provides insights for practitioners to choose appropriate summarization models, but it is incremental as it focuses on explainability and application rather than introducing a new paradigm.

The paper tackles the problem of automatic multi-faceted video summarization by proposing a framework that investigates models based on diversity, coverage, representation, and importance, arguing for their utility across different applications.

This paper addresses automatic summarization of videos in a unified manner. In particular, we propose a framework for multi-faceted summarization for extractive, query base and entity summarization (summarization at the level of entities like objects, scenes, humans and faces in the video). We investigate several summarization models which capture notions of diversity, coverage, representation and importance, and argue the utility of these different models depending on the application. While most of the prior work on submodular summarization approaches has focused oncombining several models and learning weighted mixtures, we focus on the explainability of different models and featurizations, and how they apply to different domains. We also provide implementation details on summarization systems and the different modalities involved. We hope that the study from this paper will give insights into practitioners to appropriately choose the right summarization models for the problems at hand.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes