IRCLCVApr 7, 2020

Query-controllable Video Summarization

arXiv:2004.03661v156 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient video exploration for users with specific queries, though it is incremental as it builds on existing summarization techniques.

The authors tackled the problem of generating fixed video summaries that ignore user information needs by introducing a query-controllable video summarization method, which uses text-based queries to produce tailored summaries and shows improved model performance.

When video collections become huge, how to explore both within and across videos efficiently is challenging. Video summarization is one of the ways to tackle this issue. Traditional summarization approaches limit the effectiveness of video exploration because they only generate one fixed video summary for a given input video independent of the information need of the user. In this work, we introduce a method which takes a text-based query as input and generates a video summary corresponding to it. We do so by modeling video summarization as a supervised learning problem and propose an end-to-end deep learning based method for query-controllable video summarization to generate a query-dependent video summary. Our proposed method consists of a video summary controller, video summary generator, and video summary output module. To foster the research of query-controllable video summarization and conduct our experiments, we introduce a dataset that contains frame-based relevance score labels. Based on our experimental result, it shows that the text-based query helps control the video summary. It also shows the text-based query improves our model performance. Our code and dataset: https://github.com/Jhhuangkay/Query-controllable-Video-Summarization.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes