ROLGJul 7, 2023

Polybot: Training One Policy Across Robots While Embracing Variability

arXiv:2307.03719v150 citationsh-index: 93
Originality Incremental advance
AI Analysis

This work addresses the problem of scaling vision-based robotic manipulators to everyday scenarios by enabling reuse of large datasets across varied robotic platforms, which is incremental but practical.

The paper tackles the challenge of transferring manipulation skills across different robotic platforms by proposing a framework to train a single policy for deployment on multiple robots, demonstrating significant improvements in success rate and sample efficiency when using new task data on a different robot.

Reusing large datasets is crucial to scale vision-based robotic manipulators to everyday scenarios due to the high cost of collecting robotic datasets. However, robotic platforms possess varying control schemes, camera viewpoints, kinematic configurations, and end-effector morphologies, posing significant challenges when transferring manipulation skills from one platform to another. To tackle this problem, we propose a set of key design decisions to train a single policy for deployment on multiple robotic platforms. Our framework first aligns the observation and action spaces of our policy across embodiments via utilizing wrist cameras and a unified, but modular codebase. To bridge the remaining domain shift, we align our policy's internal representations across embodiments through contrastive learning. We evaluate our method on a dataset collected over 60 hours spanning 6 tasks and 3 robots with varying joint configurations and sizes: the WidowX 250S, the Franka Emika Panda, and the Sawyer. Our results demonstrate significant improvements in success rate and sample efficiency for our policy when using new task data collected on a different robot, validating our proposed design decisions. More details and videos can be found on our anonymized project website: https://sites.google.com/view/polybot-multirobot

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes