ROAIJun 9, 2025

Language-Grounded Hierarchical Planning and Execution with Multi-Robot 3D Scene Graphs

arXiv:2506.07454v25 citationsh-index: 20
Originality Incremental advance
AI Analysis

This addresses the challenge of coordinating robot teams for complex tasks in real-world settings, representing an incremental advancement in multi-robot systems.

The paper tackles the problem of enabling multi-robot systems to execute complex natural language instructions in large-scale outdoor environments by integrating mapping, localization, and task and motion planning using 3D scene graphs, resulting in a system that supports real-time relocalization and planning for task execution.

In this paper, we introduce a multi-robot system that integrates mapping, localization, and task and motion planning (TAMP) enabled by 3D scene graphs to execute complex instructions expressed in natural language. Our system builds a shared 3D scene graph incorporating an open-set object-based map, which is leveraged for multi-robot 3D scene graph fusion. This representation supports real-time, view-invariant relocalization (via the object-based map) and planning (via the 3D scene graph), allowing a team of robots to reason about their surroundings and execute complex tasks. Additionally, we introduce a planning approach that translates operator intent into Planning Domain Definition Language (PDDL) goals using a Large Language Model (LLM) by leveraging context from the shared 3D scene graph and robot capabilities. We provide an experimental assessment of the performance of our system on real-world tasks in large-scale, outdoor environments. A supplementary video is available at https://youtu.be/8xbGGOLfLAY.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes