MMCVJan 25, 2016

Egocentric Activity Recognition with Multimodal Fisher Vector

arXiv:1601.06603v145 citations
Originality Incremental advance
AI Analysis

This work addresses activity recognition for wearable device users by providing a new dataset and fusion method, though it is incremental in nature.

The paper tackled egocentric activity recognition by creating a multimodal dataset with 20 fine-grained activity categories and proposing a Fisher Kernel framework to fuse video and enhanced sensor features, achieving improved recognition results.

With the increasing availability of wearable devices, research on egocentric activity recognition has received much attention recently. In this paper, we build a Multimodal Egocentric Activity dataset which includes egocentric videos and sensor data of 20 fine-grained and diverse activity categories. We present a novel strategy to extract temporal trajectory-like features from sensor data. We propose to apply the Fisher Kernel framework to fuse video and temporal enhanced sensor features. Experiment results show that with careful design of feature extraction and fusion algorithm, sensor data can enhance information-rich video data. We make publicly available the Multimodal Egocentric Activity dataset to facilitate future research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes