LG RODec 14, 2022

Cross-Domain Transfer via Semantic Skill Imitation

Karl Pertsch, Ruta Desai, Vikash Kumar, Franziska Meier, Joseph J. Lim, Dhruv Batra, Akshara Rai

arXiv:2212.07407v116.127 citationsh-index: 85

Originality Highly original

AI Analysis

This enables scaling robot learning by reusing demonstrations across domains, addressing a bottleneck in data collection for robotics.

The paper tackles the problem of accelerating reinforcement learning in a target domain by using demonstrations from a different source domain, such as human videos for robotic tasks, achieving performance comparable to methods requiring in-domain demonstrations with less than 3 minutes of video data.

We propose an approach for semantic imitation, which uses demonstrations from a source domain, e.g. human videos, to accelerate reinforcement learning (RL) in a different target domain, e.g. a robotic manipulator in a simulated kitchen. Instead of imitating low-level actions like joint velocities, our approach imitates the sequence of demonstrated semantic skills like "opening the microwave" or "turning on the stove". This allows us to transfer demonstrations across environments (e.g. real-world to simulated kitchen) and agent embodiments (e.g. bimanual human demonstration to robotic arm). We evaluate on three challenging cross-domain learning problems and match the performance of demonstration-accelerated RL approaches that require in-domain demonstrations. In a simulated kitchen environment, our approach learns long-horizon robot manipulation tasks, using less than 3 minutes of human video demonstrations from a real-world kitchen. This enables scaling robot learning via the reuse of demonstrations, e.g. collected as human videos, for learning in any number of target domains.

View on arXiv PDF

Similar