CVNov 9, 2017

Making a long story short: A Multi-Importance fast-forwarding egocentric videos with the emphasis on relevant objects

Michel Melo Silva, Washington Luis Souza Ramos, Felipe Cadar Chamone, João Pedro Klock Ferreira, Mario Fernando Montenegro Campos, Erickson Rangel Nascimento

arXiv:1711.03473v35.010 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the tediousness of watching long unedited first-person videos for users, though it is incremental as it builds on existing semantic fast-forwarding methods.

The paper tackles the problem of fast-forwarding egocentric videos by balancing smooth visual flow with emphasis on relevant objects, resulting in a method that retains over 3 times more semantic content than state-of-the-art approaches.

The emergence of low-cost high-quality personal wearable cameras combined with the increasing storage capacity of video-sharing websites have evoked a growing interest in first-person videos, since most videos are composed of long-running unedited streams which are usually tedious and unpleasant to watch. State-of-the-art semantic fast-forward methods currently face the challenge of providing an adequate balance between smoothness in visual flow and the emphasis on the relevant parts. In this work, we present the Multi-Importance Fast-Forward (MIFF), a fully automatic methodology to fast-forward egocentric videos facing these challenges. The dilemma of defining what is the semantic information of a video is addressed by a learning process based on the preferences of the user. Results show that the proposed method keeps over $3$ times more semantic content than the state-of-the-art fast-forward. Finally, we discuss the need of a particular video stabilization technique for fast-forward egocentric videos.

View on arXiv PDF Code

Similar