CVIRMay 5, 2015

Visual Summary of Egocentric Photostreams by Representative Keyframes

arXiv:1505.01130v242 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the need for memory reinforcement in lifelogging applications, but it appears incremental as it builds on existing keyframe selection and clustering techniques.

The paper tackles the problem of creating visual summaries from egocentric photostreams by selecting representative keyframes, using a CNN-based method with unsupervised clustering to identify events and extract keyframes, and evaluates the approach through a blind-taste test with 20 participants.

Building a visual summary from an egocentric photostream captured by a lifelogging wearable camera is of high interest for different applications (e.g. memory reinforcement). In this paper, we propose a new summarization method based on keyframes selection that uses visual features extracted by means of a convolutional neural network. Our method applies an unsupervised clustering for dividing the photostreams into events, and finally extracts the most relevant keyframe for each event. We assess the results by applying a blind-taste test on a group of 20 people who assessed the quality of the summaries.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes