ROAICVLGFeb 23, 2024

RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation

arXiv:2402.15487v272 citationsh-index: 30CoRL
Originality Incremental advance
AI Analysis

This addresses the challenge of robotic manipulation in complex environments by providing a structured representation, though it appears incremental as it builds on existing scene graph and LMM techniques.

The paper tackles the problem of enabling robots to autonomously explore environments and build an action-conditioned scene graph (ACSG) that captures both low-level and high-level information, resulting in a system that effectively facilitates real-world manipulation tasks for rigid, articulated, nested, and deformable objects.

We introduce the novel task of interactive scene exploration, wherein robots autonomously explore environments and produce an action-conditioned scene graph (ACSG) that captures the structure of the underlying environment. The ACSG accounts for both low-level information (geometry and semantics) and high-level information (action-conditioned relationships between different entities) in the scene. To this end, we present the Robotic Exploration (RoboEXP) system, which incorporates the Large Multimodal Model (LMM) and an explicit memory design to enhance our system's capabilities. The robot reasons about what and how to explore an object, accumulating new information through the interaction process and incrementally constructing the ACSG. Leveraging the constructed ACSG, we illustrate the effectiveness and efficiency of our RoboEXP system in facilitating a wide range of real-world manipulation tasks involving rigid, articulated objects, nested objects, and deformable objects.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes