CVMar 19, 2021

Probabilistic 3D Human Shape and Pose Estimation from Multiple Unconstrained Images in the Wild

arXiv:2103.10978v266 citations
AI Analysis

It addresses the problem of more accurate and robust 3D human modeling from casual images for applications like virtual try-on or motion analysis, though it is incremental by extending existing methods to multi-image groups.

The paper tackles 3D human shape and pose estimation from multiple unconstrained images, showing that multi-image inputs improve shape estimation metrics compared to single images on datasets like SSP-3D and a private tape-measured dataset, and it predicts pose uncertainty useful for occluded images while being competitive in pose metrics on 3DPW.

This paper addresses the problem of 3D human body shape and pose estimation from RGB images. Recent progress in this field has focused on single images, video or multi-view images as inputs. In contrast, we propose a new task: shape and pose estimation from a group of multiple images of a human subject, without constraints on subject pose, camera viewpoint or background conditions between images in the group. Our solution to this task predicts distributions over SMPL body shape and pose parameters conditioned on the input images in the group. We probabilistically combine predicted body shape distributions from each image to obtain a final multi-image shape prediction. We show that the additional body shape information present in multi-image input groups improves 3D human shape estimation metrics compared to single-image inputs on the SSP-3D dataset and a private dataset of tape-measured humans. In addition, predicting distributions over 3D bodies allows us to quantify pose prediction uncertainty, which is useful when faced with challenging input images with significant occlusion. Our method demonstrates meaningful pose uncertainty on the 3DPW dataset and is competitive with the state-of-the-art in terms of pose estimation metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes