CVMar 29, 2022

OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction

arXiv:2203.15709v1164 citationsh-index: 20Has Code
Originality Incremental advance
AI Analysis

This provides a foundational dataset for researchers in robotics and computer vision to study human-object manipulation, though it is incremental as it builds on existing knowledge bases.

The authors tackled the lack of comprehensive knowledge for understanding hand-object interactions by creating OakInk, a large-scale multi-modal repository with 50,000 affordance-aware interactions, and demonstrated its utility in tasks like pose estimation and grasp generation.

Learning how humans manipulate objects requires machines to acquire knowledge from two perspectives: one for understanding object affordances and the other for learning human's interactions based on the affordances. Even though these two knowledge bases are crucial, we find that current databases lack a comprehensive awareness of them. In this work, we propose a multi-modal and rich-annotated knowledge repository, OakInk, for visual and cognitive understanding of hand-object interactions. We start to collect 1,800 common household objects and annotate their affordances to construct the first knowledge base: Oak. Given the affordance, we record rich human interactions with 100 selected objects in Oak. Finally, we transfer the interactions on the 100 recorded objects to their virtual counterparts through a novel method: Tink. The recorded and transferred hand-object interactions constitute the second knowledge base: Ink. As a result, OakInk contains 50,000 distinct affordance-aware and intent-oriented hand-object interactions. We benchmark OakInk on pose estimation and grasp generation tasks. Moreover, we propose two practical applications of OakInk: intent-based interaction generation and handover generation. Our datasets and source code are publicly available at https://github.com/lixiny/OakInk.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes