CVAILGDec 11, 2023

ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose Estimation

arXiv:2312.06386v28 citationsh-index: 36NIPS
Originality Highly original
AI Analysis

This work addresses pose consistency problems in 3D human pose estimation for applications like motion capture and robotics, offering a novel approach that improves reliability over existing methods.

The paper tackles the depth ambiguity and pose-topology consistency issues in monocular 3D human pose estimation by proposing ManiPose, a manifold-constrained multi-hypothesis model that generates multiple plausible 3D poses from 2D inputs, outperforming state-of-the-art models in pose consistency by a large margin while remaining competitive on MPJPE.

We propose ManiPose, a manifold-constrained multi-hypothesis model for human-pose 2D-to-3D lifting. We provide theoretical and empirical evidence that, due to the depth ambiguity inherent to monocular 3D human pose estimation, traditional regression models suffer from pose-topology consistency issues, which standard evaluation metrics (MPJPE, P-MPJPE and PCK) fail to assess. ManiPose addresses depth ambiguity by proposing multiple candidate 3D poses for each 2D input, each with its estimated plausibility. Unlike previous multi-hypothesis approaches, ManiPose forgoes generative models, greatly facilitating its training and usage. By constraining the outputs to lie on the human pose manifold, ManiPose guarantees the consistency of all hypothetical poses, in contrast to previous works. We showcase the performance of ManiPose on real-world datasets, where it outperforms state-of-the-art models in pose consistency by a large margin while being very competitive on the MPJPE metric.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes