ROLGJul 18, 2024

The Art of Imitation: Learning Long-Horizon Manipulation Tasks from Few Demonstrations

arXiv:2407.13432v317 citationsh-index: 16
Originality Incremental advance
AI Analysis

This work addresses sample efficiency and generalization in robot manipulation for robotics researchers, though it is incremental as it builds on existing TP-GMM methods.

The paper tackled challenges in applying Task Parametrized Gaussian Mixture Models (TP-GMMs) for robot manipulation by proposing a method that factorizes end-effector velocities, segments skills, and detects task parameters from visual observations, achieving state-of-the-art performance with 20-fold improved sample efficiency from just five demonstrations.

Task Parametrized Gaussian Mixture Models (TP-GMM) are a sample-efficient method for learning object-centric robot manipulation tasks. However, there are several open challenges to applying TP-GMMs in the wild. In this work, we tackle three crucial challenges synergistically. First, end-effector velocities are non-Euclidean and thus hard to model using standard GMMs. We thus propose to factorize the robot's end-effector velocity into its direction and magnitude, and model them using Riemannian GMMs. Second, we leverage the factorized velocities to segment and sequence skills from complex demonstration trajectories. Through the segmentation, we further align skill trajectories and hence leverage time as a powerful inductive bias. Third, we present a method to automatically detect relevant task parameters per skill from visual observations. Our approach enables learning complex manipulation tasks from just five demonstrations while using only RGB-D observations. Extensive experimental evaluations on RLBench demonstrate that our approach achieves state-of-the-art performance with 20-fold improved sample efficiency. Our policies generalize across different environments, object instances, and object positions, while the learned skills are reusable.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes