CVJul 22, 2018

Deep Discriminative Model for Video Classification

arXiv:1807.08259v15 citations
Originality Incremental advance
AI Analysis

This work addresses video classification for action recognition and dynamic scenes, showing incremental improvements over existing methods.

The paper tackles video-based scene classification by proposing a Heterogeneous Deep Discriminative Model (HDDM) with unsupervised pre-training and sparse spatiotemporal feature extraction, achieving superior performance on five datasets including UCF101 and Hollywood II.

This paper presents a new deep learning approach for video-based scene classification. We design a Heterogeneous Deep Discriminative Model (HDDM) whose parameters are initialized by performing an unsupervised pre-training in a layer-wise fashion using Gaussian Restricted Boltzmann Machines (GRBM). In order to avoid the redundancy of adjacent frames, we extract spatiotemporal variation patterns within frames and represent them sparsely using Sparse Cubic Symmetrical Pattern (SCSP). Then, a pre-initialized HDDM is separately trained using the videos of each class to learn class-specific models. According to the minimum reconstruction error from the learnt class-specific models, a weighted voting strategy is employed for the classification. The performance of the proposed method is extensively evaluated on two action recognition datasets; UCF101 and Hollywood II, and three dynamic texture and dynamic scene datasets; DynTex, YUPENN, and Maryland. The experimental results and comparisons against state-of-the-art methods demonstrate that the proposed method consistently achieves superior performance on all datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes