CVJun 9, 2014

Log-Euclidean Bag of Words for Human Action Recognition

arXiv:1406.2139v30.0050 citations
AI Analysis50

This work addresses the problem of improving action recognition accuracy for video analysis, though it is incremental as it extends existing BoW approaches with Riemannian geometry.

The paper tackles human action recognition by proposing a Bag of Words model that accounts for the Riemannian geometry of covariance matrices, achieving notable improvements in discrimination accuracy compared to state-of-the-art methods.

Representing videos by densely extracted local space-time features has recently become a popular approach for analysing actions. In this paper, we tackle the problem of categorising human actions by devising Bag of Words (BoW) models based on covariance matrices of spatio-temporal features, with the features formed from histograms of optical flow. Since covariance matrices form a special type of Riemannian manifold, the space of Symmetric Positive Definite (SPD) matrices, non-Euclidean geometry should be taken into account while discriminating between covariance matrices. To this end, we propose to embed SPD manifolds to Euclidean spaces via a diffeomorphism and extend the BoW approach to its Riemannian version. The proposed BoW approach takes into account the manifold geometry of SPD matrices during the generation of the codebook and histograms. Experiments on challenging human action datasets show that the proposed method obtains notable improvements in discrimination accuracy, in comparison to several state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes