CVJan 1, 2017

Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

arXiv:1701.00295v4531 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of accurate 3D human pose estimation for computer vision applications, representing an incremental improvement over existing methods.

The paper tackles 3D human pose estimation from a single RGB image by jointly optimizing 2D joint estimation and 3D pose reconstruction, resulting in state-of-the-art performance on Human3.6M with improved 2D and 3D errors.

We propose a unified formulation for the problem of 3D human pose estimation from a single raw RGB image that reasons jointly about 2D joint estimation and 3D pose reconstruction to improve both tasks. We take an integrated approach that fuses probabilistic knowledge of 3D human pose with a multi-stage CNN architecture and uses the knowledge of plausible 3D landmark locations to refine the search for better 2D locations. The entire process is trained end-to-end, is extremely efficient and obtains state- of-the-art results on Human3.6M outperforming previous approaches both on 2D and 3D errors.

Code Implementations11 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes