CVMar 18, 2023

Grounding 3D Object Affordance from 2D Interactions in Images

arXiv:2303.10437v275 citationsh-index: 104Has Code
AI Analysis

It addresses a novel task for embodied agents by leveraging 2D interactions to improve generalization in 3D affordance perception, which is incremental as it builds on existing geometry-focused approaches.

This paper tackles the problem of grounding 3D object affordance from 2D interactions in images, aiming to locate action possibilities in 3D space for embodied agents, and demonstrates the reliability of the proposed task and the superiority of their method through comprehensive experiments on a new dataset.

Grounding 3D object affordance seeks to locate objects' ''action possibilities'' regions in the 3D space, which serves as a link between perception and operation for embodied agents. Existing studies primarily focus on connecting visual affordances with geometry structures, e.g. relying on annotations to declare interactive regions of interest on the object and establishing a mapping between the regions and affordances. However, the essence of learning object affordance is to understand how to use it, and the manner that detaches interactions is limited in generalization. Normally, humans possess the ability to perceive object affordances in the physical world through demonstration images or videos. Motivated by this, we introduce a novel task setting: grounding 3D object affordance from 2D interactions in images, which faces the challenge of anticipating affordance through interactions of different sources. To address this problem, we devise a novel Interaction-driven 3D Affordance Grounding Network (IAG), which aligns the region feature of objects from different sources and models the interactive contexts for 3D object affordance grounding. Besides, we collect a Point-Image Affordance Dataset (PIAD) to support the proposed task. Comprehensive experiments on PIAD demonstrate the reliability of the proposed task and the superiority of our method. The project is available at https://github.com/yyvhang/IAGNet.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes