ROAILGOct 25, 2024

MILES: Making Imitation Learning Easy with Self-Supervision

arXiv:2410.19693v112 citationsh-index: 9CoRL
Originality Incremental advance
AI Analysis

This addresses the need for reduced human supervision in imitation learning for robotics, offering a more efficient approach, though it appears incremental as it builds on existing imitation learning methods.

The paper tackles the problem of laborious data collection in imitation learning by proposing MILES, a fully autonomous, self-supervised paradigm that enables efficient policy learning from just a single demonstration and a single environment reset, significantly outperforming state-of-the-art alternatives in real-world tasks like lock manipulation.

Data collection in imitation learning often requires significant, laborious human supervision, such as numerous demonstrations, and/or frequent environment resets for methods that incorporate reinforcement learning. In this work, we propose an alternative approach, MILES: a fully autonomous, self-supervised data collection paradigm, and we show that this enables efficient policy learning from just a single demonstration and a single environment reset. MILES autonomously learns a policy for returning to and then following the single demonstration, whilst being self-guided during data collection, eliminating the need for additional human interventions. We evaluated MILES across several real-world tasks, including tasks that require precise contact-rich manipulation such as locking a lock with a key. We found that, under the constraints of a single demonstration and no repeated environment resetting, MILES significantly outperforms state-of-the-art alternatives like imitation learning methods that leverage reinforcement learning. Videos of our experiments and code can be found on our webpage: www.robot-learning.uk/miles.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes