CVJul 30, 2018

Human Motion Analysis with Deep Metric Learning

arXiv:1807.11176v254 citations
Originality Incremental advance
AI Analysis

This work addresses the limitation of traditional metrics in capturing semantic relationships for human motion analysis, which is incremental as it builds on deep metric learning with specific adaptations.

The paper tackled the problem of measuring similarity between human motions for tasks like gait analysis and action retrieval by proposing a triplet-based deep metric learning approach with a novel objective and attentive recurrent neural network architecture, achieving significant improvements over conventional metrics on two datasets.

Effectively measuring the similarity between two human motions is necessary for several computer vision tasks such as gait analysis, person identi- fication and action retrieval. Nevertheless, we believe that traditional approaches such as L2 distance or Dynamic Time Warping based on hand-crafted local pose metrics fail to appropriately capture the semantic relationship across motions and, as such, are not suitable for being employed as metrics within these tasks. This work addresses this limitation by means of a triplet-based deep metric learning specifically tailored to deal with human motion data, in particular with the prob- lem of varying input size and computationally expensive hard negative mining due to motion pair alignment. Specifically, we propose (1) a novel metric learn- ing objective based on a triplet architecture and Maximum Mean Discrepancy; as well as, (2) a novel deep architecture based on attentive recurrent neural networks. One benefit of our objective function is that it enforces a better separation within the learned embedding space of the different motion categories by means of the associated distribution moments. At the same time, our attentive recurrent neural network allows processing varying input sizes to a fixed size of embedding while learning to focus on those motion parts that are semantically distinctive. Our ex- periments on two different datasets demonstrate significant improvements over conventional human motion metrics.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes