CVSep 14, 2023

HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image

arXiv:2309.07891v510 citationsh-index: 41
Originality Incremental advance
AI Analysis

This work addresses a domain-specific problem for robotics and computer vision applications, such as grasping and manipulation, by providing a method to infer 3D scenes from single images, though it is incremental as it builds on prior neural radiance field techniques.

The paper tackles the problem of reconstructing 3D hand-object interaction scenes from a single RGB image, addressing challenges like depth ambiguity and occlusions, and shows that HandNeRF reconstructs novel grasp configurations more accurately than comparable methods on real-world datasets.

This paper presents a method to learn hand-object interaction prior for reconstructing a 3D hand-object scene from a single RGB image. The inference as well as training-data generation for 3D hand-object scene reconstruction is challenging due to the depth ambiguity of a single image and occlusions by the hand and object. We turn this challenge into an opportunity by utilizing the hand shape to constrain the possible relative configuration of the hand and object geometry. We design a generalizable implicit function, HandNeRF, that explicitly encodes the correlation of the 3D hand shape features and 2D object features to predict the hand and object scene geometry. With experiments on real-world datasets, we show that HandNeRF is able to reconstruct hand-object scenes of novel grasp configurations more accurately than comparable methods. Moreover, we demonstrate that object reconstruction from HandNeRF ensures more accurate execution of downstream tasks, such as grasping and motion planning for robotic hand-over and manipulation. Homepage: https://samsunglabs.github.io/HandNeRF-project-page/

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes