ROCLHCJun 10, 2023

AutoTAMP: Autoregressive Task and Motion Planning with LLMs as Translators and Checkers

arXiv:2306.06531v3177 citationsh-index: 84
Originality Incremental advance
AI Analysis

This addresses the challenge of robust human-robot interaction in complex environments with constraints, though it is incremental as it builds on existing TAMP and LLM translation approaches.

The paper tackles the problem of enabling robots to understand and execute complex, long-horizon tasks described in natural language by using LLMs to translate language into an intermediate representation for joint task-and-motion planning, resulting in significant improvements in task completion over existing LLM-based methods.

For effective human-robot interaction, robots need to understand, plan, and execute complex, long-horizon tasks described by natural language. Recent advances in large language models (LLMs) have shown promise for translating natural language into robot action sequences for complex tasks. However, existing approaches either translate the natural language directly into robot trajectories or factor the inference process by decomposing language into task sub-goals and relying on a motion planner to execute each sub-goal. When complex environmental and temporal constraints are involved, inference over planning tasks must be performed jointly with motion plans using traditional task-and-motion planning (TAMP) algorithms, making factorization into subgoals untenable. Rather than using LLMs to directly plan task sub-goals, we instead perform few-shot translation from natural language task descriptions to an intermediate task representation that can then be consumed by a TAMP algorithm to jointly solve the task and motion plan. To improve translation, we automatically detect and correct both syntactic and semantic errors via autoregressive re-prompting, resulting in significant improvements in task completion. We show that our approach outperforms several methods using LLMs as planners in complex task domains. See our project website https://yongchao98.github.io/MIT-REALM-AutoTAMP/ for prompts, videos, and code.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes