CVDec 6, 2020

MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation

arXiv:2012.03206v136 citationsHas Code
AI Analysis

This work provides a valuable, large-scale, and accurately annotated multi-view dataset for researchers working on 3D hand pose estimation, addressing the limitations of existing smaller and less accurate datasets.

This paper addresses the challenge of 3D hand pose estimation from single RGB images by introducing MVHM, a large-scale multi-view hand mesh dataset with accurate 3D hand mesh and joint labels. They developed a spin match algorithm and an efficient pipeline to generate this dataset, which, when used for training, significantly improved hand pose estimation performance, achieving an AUC_20-50 of 0.990 on the MHP dataset compared to the previous state-of-the-art of 0.939.

Estimating 3D hand poses from a single RGB image is challenging because depth ambiguity leads the problem ill-posed. Training hand pose estimators with 3D hand mesh annotations and multi-view images often results in significant performance gains. However, existing multi-view datasets are relatively small with hand joints annotated by off-the-shelf trackers or automated through model predictions, both of which may be inaccurate and can introduce biases. Collecting a large-scale multi-view 3D hand pose images with accurate mesh and joint annotations is valuable but strenuous. In this paper, we design a spin match algorithm that enables a rigid mesh model matching with any target mesh ground truth. Based on the match algorithm, we propose an efficient pipeline to generate a large-scale multi-view hand mesh (MVHM) dataset with accurate 3D hand mesh and joint labels. We further present a multi-view hand pose estimation approach to verify that training a hand pose estimator with our generated dataset greatly enhances the performance. Experimental results show that our approach achieves the performance of 0.990 in $\text{AUC}_{\text{20-50}}$ on the MHP dataset compared to the previous state-of-the-art of 0.939 on this dataset. Our datasset is public available. \footnote{\url{https://github.com/Kuzphi/MVHM}} Our datasset is available at~\href{https://github.com/Kuzphi/MVHM}{\color{blue}{https://github.com/Kuzphi/MVHM}}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes