RO AI CLApr 30, 2025

LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics

Marc Glocker, Peter Hönig, Matthias Hirschmanner, Markus Vincze

arXiv:2504.21716v120.022 citationsh-index: 5Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of long-term task planning and memory in household robotics for users, but it is incremental as it combines existing methods like LLMs, RAG, and vision models.

The paper tackles autonomous household object management by developing an embodied robotic system with an LLM-driven agent-orchestration architecture, achieving high task planning accuracy and improved memory recall through retrieval-augmented generation (RAG).

We present an embodied robotic system with an LLM-driven agent-orchestration architecture for autonomous household object management. The system integrates memory-augmented task planning, enabling robots to execute high-level user commands while tracking past actions. It employs three specialized agents: a routing agent, a task planning agent, and a knowledge base agent, each powered by task-specific LLMs. By leveraging in-context learning, our system avoids the need for explicit model training. RAG enables the system to retrieve context from past interactions, enhancing long-term object tracking. A combination of Grounded SAM and LLaMa3.2-Vision provides robust object detection, facilitating semantic scene understanding for task planning. Evaluation across three household scenarios demonstrates high task planning accuracy and an improvement in memory recall due to RAG. Specifically, Qwen2.5 yields best performance for specialized agents, while LLaMA3.1 excels in routing tasks. The source code is available at: https://github.com/marc1198/chat-hsr.

View on arXiv PDF Code

Similar