ROCVLGNov 22, 2024

WildLMa: Long Horizon Loco-Manipulation in the Wild

arXiv:2411.15131v216 citationsh-index: 16ICRA
Originality Incremental advance
AI Analysis

This work addresses the challenge of deploying robots for complex mobile manipulation in varied settings, though it appears incremental by building on existing imitation learning and LLM planning methods.

The paper tackles the problem of enabling quadruped robots with manipulators to perform long-horizon loco-manipulation tasks in diverse real-world environments, achieving a higher grasping success rate over existing RL baselines using only tens of demonstrations and generalizing to unseen objects.

'In-the-wild' mobile manipulation aims to deploy robots in diverse real-world environments, which requires the robot to (1) have skills that generalize across object configurations; (2) be capable of long-horizon task execution in diverse environments; and (3) perform complex manipulation beyond pick-and-place. Quadruped robots with manipulators hold promise for extending the workspace and enabling robust locomotion, but existing results do not investigate such a capability. This paper proposes WildLMa with three components to address these issues: (1) adaptation of learned low-level controller for VR-enabled whole-body teleoperation and traversability; (2) WildLMa-Skill -- a library of generalizable visuomotor skills acquired via imitation learning or heuristics and (3) WildLMa-Planner -- an interface of learned skills that allow LLM planners to coordinate skills for long-horizon tasks. We demonstrate the importance of high-quality training data by achieving higher grasping success rate over existing RL baselines using only tens of demonstrations. WildLMa exploits CLIP for language-conditioned imitation learning that empirically generalizes to objects unseen in training demonstrations. Besides extensive quantitative evaluation, we qualitatively demonstrate practical robot applications, such as cleaning up trash in university hallways or outdoor terrains, operating articulated objects, and rearranging items on a bookshelf.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes