A gaze driven fast-forward method for first-person videos
This addresses the challenge of efficiently navigating large volumes of first-person video data for users like life-loggers or researchers, though it appears incremental as it builds on existing attention models.
The paper tackles the problem of accessing relevant information in unedited First-Person Videos by creating an accelerated version that emphasizes important moments to the recorder, using a gaze and visual scene analysis attention model to achieve this.
The growing data sharing and life-logging cultures are driving an unprecedented increase in the amount of unedited First-Person Videos. In this paper, we address the problem of accessing relevant information in First-Person Videos by creating an accelerated version of the input video and emphasizing the important moments to the recorder. Our method is based on an attention model driven by gaze and visual scene analysis that provides a semantic score of each frame of the input video. We performed several experimental evaluations on publicly available First-Person Videos datasets. The results show that our methodology can fast-forward videos emphasizing moments when the recorder visually interact with scene components while not including monotonous clips.