CV LG NENov 20, 2016

Fast Video Classification via Adaptive Cascading of Deep Models

Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy

arXiv:1611.06453v214.582 citationsh-index: 89

Originality Incremental advance

AI Analysis

This work addresses the computational cost of video classification for applications like media analysis, though it is incremental as it builds on existing classifier methods.

The paper tackled the problem of costly video classification by exploiting short-term skewed class distributions, achieving end-to-end speedups of 2.4-7.8x on GPU and 2.6-11.2x on CPU compared to a state-of-the-art CNN while maintaining competitive accuracy.

Recent advances have enabled "oracle" classifiers that can classify across many classes and input distributions with high accuracy without retraining. However, these classifiers are relatively heavyweight, so that applying them to classify video is costly. We show that day-to-day video exhibits highly skewed class distributions over the short term, and that these distributions can be classified by much simpler models. We formulate the problem of detecting the short-term skews online and exploiting models based on it as a new sequential decision making problem dubbed the Online Bandit Problem, and present a new algorithm to solve it. When applied to recognizing faces in TV shows and movies, we realize end-to-end classification speedups of 2.4-7.8x/2.6-11.2x (on GPU/CPU) relative to a state-of-the-art convolutional neural network, at competitive accuracy.

View on arXiv PDF

Similar