LGAIMay 22, 2019

COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration

arXiv:1905.09275v2132 citations
AI Analysis

This addresses data efficiency and robustness challenges for reinforcement learning in continuous control, though it appears incremental as it builds on existing model-based and unsupervised learning approaches.

The paper tackles data inefficiency and robustness to task-irrelevant perturbations in deep reinforcement learning by introducing COBRA, which uses unsupervised object discovery and curiosity-driven exploration to build object-based models, enabling learning of various tasks in very few steps and excelling in robustness tests.

Data efficiency and robustness to task-irrelevant perturbations are long-standing challenges for deep reinforcement learning algorithms. Here we introduce a modular approach to addressing these challenges in a continuous control environment, without using hand-crafted or supervised information. Our Curious Object-Based seaRch Agent (COBRA) uses task-free intrinsically motivated exploration and unsupervised learning to build object-based models of its environment and action space. Subsequently, it can learn a variety of tasks through model-based search in very few steps and excel on structured hold-out tests of policy robustness.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes