CVAug 17, 2018

Neural Body Fitting: Unifying Deep Learning and Model-Based Human Pose and Shape Estimation

arXiv:1808.05942v1525 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the problem of 3D human pose estimation for computer vision applications, presenting an incremental improvement by combining deep learning with model-based methods.

The paper tackles the challenge of directly predicting 3D human pose and shape from 2D images by proposing Neural Body Fitting (NBF), which integrates a statistical body model with a CNN using part segmentations and model constraints, achieving competitive results on standard benchmarks.

Direct prediction of 3D body pose and shape remains a challenge even for highly parameterized deep learning models. Mapping from the 2D image space to the prediction space is difficult: perspective ambiguities make the loss function noisy and training data is scarce. In this paper, we propose a novel approach (Neural Body Fitting (NBF)). It integrates a statistical body model within a CNN, leveraging reliable bottom-up semantic body part segmentation and robust top-down body model constraints. NBF is fully differentiable and can be trained using 2D and 3D annotations. In detailed experiments, we analyze how the components of our model affect performance, especially the use of part segmentations as an explicit intermediate representation, and present a robust, efficiently trainable framework for 3D human pose estimation from 2D images with competitive results on standard benchmarks. Code will be made available at http://github.com/mohomran/neural_body_fitting

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes