ROLGJan 20, 2022

DROPO: Sim-to-Real Transfer with Offline Domain Randomization

arXiv:2201.08434v248 citations
AI Analysis

This addresses the problem of safe and efficient sim-to-real transfer for robotic manipulation, though it is incremental as it builds on existing domain randomization techniques.

The paper tackles the difficulty of finding optimal randomization distributions for sim-to-real transfer in robotic reinforcement learning by introducing DROPO, a method that estimates these distributions using only a limited offline dataset and models parameter uncertainty, resulting in successful domain transfer and improved performance over prior methods.

In recent years, domain randomization over dynamics parameters has gained a lot of traction as a method for sim-to-real transfer of reinforcement learning policies in robotic manipulation; however, finding optimal randomization distributions can be difficult. In this paper, we introduce DROPO, a novel method for estimating domain randomization distributions for safe sim-to-real transfer. Unlike prior work, DROPO only requires a limited, precollected offline dataset of trajectories, and explicitly models parameter uncertainty to match real data using a likelihood-based approach. We demonstrate that DROPO is capable of recovering dynamic parameter distributions in simulation and finding a distribution capable of compensating for an unmodeled phenomenon. We also evaluate the method in two zero-shot sim-to-real transfer scenarios, showing successful domain transfer and improved performance over prior methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes