ROMay 25

RePlan-Bot: Multi-Level Replanning for Embodied Instruction Following

Xicheng Gong, Guozheng Sun, Peiran Xu, Yadong Mu

arXiv:2605.2585169.3

Predicted impact top 26% in RO · last 90 daysOriginality Highly original

AI Analysis

For embodied AI agents, this work addresses long-horizon planning and irreversible state changes, improving task success rates in complex 3D environments.

RePlan-Bot introduces multi-level replanning for embodied instruction following, achieving state-of-the-art performance on the ALFRED benchmark with superior adaptability and reliability in both seen and unseen environments.

Embodied instruction following (EIF) requires agents to understand and execute complex natural language commands within interactive 3D environments. Despite recent advances, existing methods often fail in long-horizon planning and handling irreversible state changes, resulting in low task success rates. To address these challenges, we introduce RePlan-Bot, a novel EIF agent that performs multi-level, continuous replanning throughout task execution. RePlan-Bot integrates a high-level LLM-based auditor for dynamic sub-goal adjustments guided by environmental feedback, a commonsense-guided search mechanism based on a multi-layered instance map for precise and structured object localization, and a lightweight ViT-based corrector to preemptively fix risky low-level actions. Evaluated on the ALFRED benchmark, RePlan-Bot achieves state-of-the-art performance in both seen and unseen environments, demonstrating superior adaptability and reliability.

View on arXiv PDF

Similar