ROAICVOct 6, 2018

Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning

arXiv:1810.03043v165 citations
Originality Highly original
AI Analysis

This addresses the problem of autonomous robotic manipulation for robots, offering a novel approach to improve robustness in real-world scenarios.

The paper tackles the challenge of using predictive models for robotic control with raw image inputs by proposing a self-supervised learning method that enables robots to retry tasks continuously, achieving complex manipulation tasks like grasping and repositioning with a model trained on 160 hours of unlabeled data.

Prediction is an appealing objective for self-supervised learning of behavioral skills, particularly for autonomous robots. However, effectively utilizing predictive models for control, especially with raw image inputs, poses a number of major challenges. How should the predictions be used? What happens when they are inaccurate? In this paper, we tackle these questions by proposing a method for learning robotic skills from raw image observations, using only autonomously collected experience. We show that even an imperfect model can complete complex tasks if it can continuously retry, but this requires the model to not lose track of the objective (e.g., the object of interest). To enable a robot to continuously retry a task, we devise a self-supervised algorithm for learning image registration, which can keep track of objects of interest for the duration of the trial. We demonstrate that this idea can be combined with a video-prediction based controller to enable complex behaviors to be learned from scratch using only raw visual inputs, including grasping, repositioning objects, and non-prehensile manipulation. Our real-world experiments demonstrate that a model trained with 160 robot hours of autonomously collected, unlabeled data is able to successfully perform complex manipulation tasks with a wide range of objects not seen during training.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes