LG AIMay 22, 2019

COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration

Nicholas Watters, Loic Matthey, Matko Bosnjak, Christopher P. Burgess, Alexander Lerchner

arXiv:1905.09275v225.7132 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses data efficiency and robustness challenges for reinforcement learning in continuous control, though it appears incremental as it builds on existing model-based and unsupervised learning approaches.

The paper tackles data inefficiency and robustness to task-irrelevant perturbations in deep reinforcement learning by introducing COBRA, which uses unsupervised object discovery and curiosity-driven exploration to build object-based models, enabling learning of various tasks in very few steps and excelling in robustness tests.

Data efficiency and robustness to task-irrelevant perturbations are long-standing challenges for deep reinforcement learning algorithms. Here we introduce a modular approach to addressing these challenges in a continuous control environment, without using hand-crafted or supervised information. Our Curious Object-Based seaRch Agent (COBRA) uses task-free intrinsically motivated exploration and unsupervised learning to build object-based models of its environment and action space. Subsequently, it can learn a variety of tasks through model-based search in very few steps and excel on structured hold-out tests of policy robustness.

View on arXiv PDF Code

Similar