CVDec 2, 2016

Learning to Search on Manifolds for 3D Pose Estimation of Articulated Objects

arXiv:1612.00596v19 citations
Originality Incremental advance
AI Analysis

It addresses pose estimation for diverse articulated objects like hands and animals, offering incremental improvements in speed and accuracy.

The paper tackles 3D pose estimation of articulated objects from single depth images by formulating it as a sequential search on manifolds, achieving competitive performance and real-time operation at over 30 FPS.

This paper focuses on the challenging problem of 3D pose estimation of a diverse spectrum of articulated objects from single depth images. A novel structured prediction approach is considered, where 3D poses are represented as skeletal models that naturally operate on manifolds. Given an input depth image, the problem of predicting the most proper articulation of underlying skeletal model is thus formulated as sequentially searching for the optimal skeletal configuration. This is subsequently addressed by convolutional neural nets trained end-to-end to render sequential prediction of the joint locations as regressing a set of tangent vectors of the underlying manifolds. Our approach is examined on various articulated objects including human hand, mouse, and fish benchmark datasets. Empirically it is shown to deliver highly competitive performance with respect to the state-of-the-arts, while operating in real-time (over 30 FPS).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes