CVMay 1, 2016

Dominant Codewords Selection with Topic Model for Action Recognition

arXiv:1605.00324v12 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for human activity recognition in video analysis.

The paper tackles action recognition by using latent Dirichlet allocation to model human motion primitives and selecting dominant codewords, demonstrating results on four datasets.

In this paper, we propose a framework for recognizing human activities that uses only in-topic dominant codewords and a mixture of intertopic vectors. Latent Dirichlet allocation (LDA) is used to develop approximations of human motion primitives; these are mid-level representations, and they adaptively integrate dominant vectors when classifying human activities. In LDA topic modeling, action videos (documents) are represented by a bag-of-words (input from a dictionary), and these are based on improved dense trajectories. The output topics correspond to human motion primitives, such as finger moving or subtle leg motion. We eliminate the impurities, such as missed tracking or changing light conditions, in each motion primitive. The assembled vector of motion primitives is an improved representation of the action. We demonstrate our method on four different datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes