ROAISep 24, 2024

Long-horizon Embodied Planning with Implicit Logical Inference and Hallucination Mitigation

arXiv:2409.15658v22 citationsh-index: 6
Originality Highly original
AI Analysis

This addresses the challenge of logical errors and hallucinations in embodied AI planning for robots, offering a practical solution for diverse long-horizon tasks.

The paper tackles the problem of long-horizon embodied planning in AI by introducing ReLEP, a framework that decomposes abstract instructions into actionable steps without needing in-context examples, achieving high success rates and outperforming state-of-the-art methods on unseen tasks.

Long-horizon embodied planning underpins embodied AI. To accomplish long-horizon tasks, one of the most feasible ways is to decompose abstract instructions into a sequence of actionable steps. Foundation models still face logical errors and hallucinations in long-horizon planning, unless provided with highly relevant examples to the tasks. However, providing highly relevant examples for any random task is unpractical. Therefore, we present ReLEP, a novel framework for Real-time Long-horizon Embodied Planning. ReLEP can complete a wide range of long-horizon tasks without in-context examples by learning implicit logical inference through fine-tuning. The fine-tuned large vision-language model formulates plans as sequences of skill functions. These functions are selected from a carefully designed skill library. ReLEP is also equipped with a Memory module for plan and status recall, and a Robot Configuration module for versatility across robot types. In addition, we propose a data generation pipeline to tackle dataset scarcity. When constructing the dataset, we considered the implicit logical relationships, enabling the model to learn implicit logical relationships and dispel hallucinations. Through comprehensive evaluations across various long-horizon tasks, ReLEP demonstrates high success rates and compliance to execution even on unseen tasks and outperforms state-of-the-art baseline methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes