RO LGOct 17, 2017

Domain Randomization and Generative Models for Robotic Grasping

Joshua Tobin, Lukas Biewald, Rocky Duan, Marcin Andrychowicz, Ankur Handa, Vikash Kumar, Bob McGrew, Jonas Schneider, Peter Welinder, Wojciech Zaremba, Pieter Abbeel

arXiv:1710.06425v232.6188 citations

Originality Highly original

AI Analysis

This addresses the challenge of generalization in robotic grasping for applications requiring robust manipulation with minimal real-world data.

The paper tackled the problem of limited generalization in deep learning-based robotic grasping by developing a data generation pipeline using domain randomization and an autoregressive grasp planning model, achieving over 90% success rate on unseen objects in simulation and 80% in real-world tests.

Deep learning-based robotic grasping has made significant progress thanks to algorithmic improvements and increased data availability. However, state-of-the-art models are often trained on as few as hundreds or thousands of unique object instances, and as a result generalization can be a challenge. In this work, we explore a novel data generation pipeline for training a deep neural network to perform grasp planning that applies the idea of domain randomization to object synthesis. We generate millions of unique, unrealistic procedurally generated objects, and train a deep neural network to perform grasp planning on these objects. Since the distribution of successful grasps for a given object can be highly multimodal, we propose an autoregressive grasp planning model that maps sensor inputs of a scene to a probability distribution over possible grasps. This model allows us to sample grasps efficiently at test time (or avoid sampling entirely). We evaluate our model architecture and data generation pipeline in simulation and the real world. We find we can achieve a $>$90% success rate on previously unseen realistic objects at test time in simulation despite having only been trained on random objects. We also demonstrate an 80% success rate on real-world grasp attempts despite having only been trained on random simulated objects.

View on arXiv PDF

Similar