CVLGNENov 20, 2016

Fast Video Classification via Adaptive Cascading of Deep Models

arXiv:1611.06453v282 citations
Originality Incremental advance
AI Analysis

This work addresses the computational cost of video classification for applications like media analysis, though it is incremental as it builds on existing classifier methods.

The paper tackled the problem of costly video classification by exploiting short-term skewed class distributions, achieving end-to-end speedups of 2.4-7.8x on GPU and 2.6-11.2x on CPU compared to a state-of-the-art CNN while maintaining competitive accuracy.

Recent advances have enabled "oracle" classifiers that can classify across many classes and input distributions with high accuracy without retraining. However, these classifiers are relatively heavyweight, so that applying them to classify video is costly. We show that day-to-day video exhibits highly skewed class distributions over the short term, and that these distributions can be classified by much simpler models. We formulate the problem of detecting the short-term skews online and exploiting models based on it as a new sequential decision making problem dubbed the Online Bandit Problem, and present a new algorithm to solve it. When applied to recognizing faces in TV shows and movies, we realize end-to-end classification speedups of 2.4-7.8x/2.6-11.2x (on GPU/CPU) relative to a state-of-the-art convolutional neural network, at competitive accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes