CVApr 27, 2022

Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution

arXiv:2204.13062v188 citationsh-index: 65
Originality Incremental advance
AI Analysis

This addresses a limitation in augmented and virtual reality applications by enabling reconstruction without explicit physical constraints or object models, though it is incremental as it builds on existing deep learning approaches.

The paper tackles the problem of reconstructing hand and object poses and shapes during interaction without requiring known object models, achieving state-of-the-art accuracy in 3D pose estimation on four benchmarks.

Estimating the pose and shape of hands and objects under interaction finds numerous applications including augmented and virtual reality. Existing approaches for hand and object reconstruction require explicitly defined physical constraints and known objects, which limits its application domains. Our algorithm is agnostic to object models, and it learns the physical rules governing hand-object interaction. This requires automatically inferring the shapes and physical interaction of hands and (potentially unknown) objects. We seek to approach this challenging problem by proposing a collaborative learning strategy where two-branches of deep networks are learning from each other. Specifically, we transfer hand mesh information to the object branch and vice versa for the hand branch. The resulting optimisation (training) problem can be unstable, and we address this via two strategies: (i) attention-guided graph convolution which helps identify and focus on mutual occlusion and (ii) unsupervised associative loss which facilitates the transfer of information between the branches. Experiments using four widely-used benchmarks show that our framework achieves beyond state-of-the-art accuracy in 3D pose estimation, as well as recovers dense 3D hand and object shapes. Each technical component above contributes meaningfully in the ablation study.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes