CVJul 22, 2022

3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal

arXiv:2207.11061v159 citationsh-index: 98Has Code
Originality Incremental advance
AI Analysis

It addresses hand-hand occlusion and ambiguity in interacting hand pose estimation, which is essential for understanding human actions, but is incremental as it builds on single-hand pose estimation systems.

The paper tackles 3D interacting hand pose estimation from a single RGB image by decomposing the task to estimate each hand separately, using a Hand De-occlusion and Removal framework and a new synthetic dataset, and reports significant outperformance over previous state-of-the-art methods.

Estimating 3D interacting hand pose from a single RGB image is essential for understanding human actions. Unlike most previous works that directly predict the 3D poses of two interacting hands simultaneously, we propose to decompose the challenging interacting hand pose estimation task and estimate the pose of each hand separately. In this way, it is straightforward to take advantage of the latest research progress on the single-hand pose estimation system. However, hand pose estimation in interacting scenarios is very challenging, due to (1) severe hand-hand occlusion and (2) ambiguity caused by the homogeneous appearance of hands. To tackle these two challenges, we propose a novel Hand De-occlusion and Removal (HDR) framework to perform hand de-occlusion and distractor removal. We also propose the first large-scale synthetic amodal hand dataset, termed Amodal InterHand Dataset (AIH), to facilitate model training and promote the development of the related research. Experiments show that the proposed method significantly outperforms previous state-of-the-art interacting hand pose estimation approaches. Codes and data are available at https://github.com/MengHao666/HDR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes