LGRODec 14, 2022

Cross-Domain Transfer via Semantic Skill Imitation

arXiv:2212.07407v127 citationsh-index: 85
Originality Highly original
AI Analysis

This enables scaling robot learning by reusing demonstrations across domains, addressing a bottleneck in data collection for robotics.

The paper tackles the problem of accelerating reinforcement learning in a target domain by using demonstrations from a different source domain, such as human videos for robotic tasks, achieving performance comparable to methods requiring in-domain demonstrations with less than 3 minutes of video data.

We propose an approach for semantic imitation, which uses demonstrations from a source domain, e.g. human videos, to accelerate reinforcement learning (RL) in a different target domain, e.g. a robotic manipulator in a simulated kitchen. Instead of imitating low-level actions like joint velocities, our approach imitates the sequence of demonstrated semantic skills like "opening the microwave" or "turning on the stove". This allows us to transfer demonstrations across environments (e.g. real-world to simulated kitchen) and agent embodiments (e.g. bimanual human demonstration to robotic arm). We evaluate on three challenging cross-domain learning problems and match the performance of demonstration-accelerated RL approaches that require in-domain demonstrations. In a simulated kitchen environment, our approach learns long-horizon robot manipulation tasks, using less than 3 minutes of human video demonstrations from a real-world kitchen. This enables scaling robot learning via the reuse of demonstrations, e.g. collected as human videos, for learning in any number of target domains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes