ROAICLFLSep 28, 2024

SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models

arXiv:2409.19471v226 citationsh-index: 7Has Code
Originality Highly original
AI Analysis

This work addresses the problem of reliable and safe task planning for robot agents using large language models, representing a novel method for a known bottleneck in robotics.

The paper tackles the challenge of ensuring robot agents adhere to user-specified constraints when generating task plans from natural language commands, particularly for complex and long-horizon tasks. It introduces SELP, which combines equivalence voting, constrained decoding, and domain-specific fine-tuning to improve safety and efficiency, achieving up to 20.4% improvement in safety rates and 19.8% in plan efficiency compared to state-of-the-art planners.

Despite significant advancements in large language models (LLMs) that enhance robot agents' understanding and execution of natural language (NL) commands, ensuring the agents adhere to user-specified constraints remains challenging, particularly for complex commands and long-horizon tasks. To address this challenge, we present three key insights, equivalence voting, constrained decoding, and domain-specific fine-tuning, which significantly enhance LLM planners' capability in handling complex tasks. Equivalence voting ensures consistency by generating and sampling multiple Linear Temporal Logic (LTL) formulas from NL commands, grouping equivalent LTL formulas, and selecting the majority group of formulas as the final LTL formula. Constrained decoding then uses the generated LTL formula to enforce the autoregressive inference of plans, ensuring the generated plans conform to the LTL. Domain-specific fine-tuning customizes LLMs to produce safe and efficient plans within specific task domains. Our approach, Safe Efficient LLM Planner (SELP), combines these insights to create LLM planners to generate plans adhering to user commands with high confidence. We demonstrate the effectiveness and generalizability of SELP across different robot agents and tasks, including drone navigation and robot manipulation. For drone navigation tasks, SELP outperforms state-of-the-art planners by 10.8% in safety rate (i.e., finishing tasks conforming to NL commands) and by 19.8% in plan efficiency. For robot manipulation tasks, SELP achieves 20.4% improvement in safety rate. Our datasets for evaluating NL-to-LTL and robot task planning will be released in github.com/lt-asset/selp.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes