CVJan 26, 2019

Real-time Video Summarization on Commodity Hardware

arXiv:1901.09287v12 citations
Originality Incremental advance
AI Analysis

This addresses the problem of efficient video summarization for users needing quick processing on standard hardware, though it is incremental as it builds on existing datasets and methods.

The paper tackles real-time video summarization on commodity hardware by using low-level features and tree-based models to rank segments, achieving comparable accuracy to a state-of-the-art deep learning method with significantly faster run-times, generating summaries in less time than the video duration.

We present a method for creating video summaries in real-time on commodity hardware. Real-time here refers to the fact that the time required for video summarization is less than the duration of the input video. First, low-level features are use to discard undesirable frames. Next, video is divided into segments, and segment-level features are extracted for each segment. Tree-based models trained on widely available video summarization and computational aesthetics datasets are then used to rank individual segments, and top-ranked segments are selected to generate the final video summary. We evaluate the proposed method on SUMME dataset and show that our method is able to achieve summarization accuracy that is comparable to that of a current state-of-the-art deep learning method, while posting significantly faster run-times. Our method on average is able to generate a video summary in time that is shorter than the duration of the video.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes