AIOct 15, 2018

CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning

arXiv:1810.06284v5190 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of scalable and adaptive learning for autonomous agents in complex, unstructured environments, representing an incremental advance in multi-goal reinforcement learning.

The paper tackles the problem of autonomous goal-setting and curriculum learning in open-ended environments by proposing CURIOUS, an algorithm that uses modular value functions and automated curriculum learning to achieve diverse goals, resulting in robust self-organized learning and demonstrated resilience to distractions and changes.

In open-ended environments, autonomous learning agents must set their own goals and build their own curriculum through an intrinsically motivated exploration. They may consider a large diversity of goals, aiming to discover what is controllable in their environments, and what is not. Because some goals might prove easy and some impossible, agents must actively select which goal to practice at any moment, to maximize their overall mastery on the set of learnable goals. This paper proposes CURIOUS, an algorithm that leverages 1) a modular Universal Value Function Approximator with hindsight learning to achieve a diversity of goals of different kinds within a unique policy and 2) an automated curriculum learning mechanism that biases the attention of the agent towards goals maximizing the absolute learning progress. Agents focus sequentially on goals of increasing complexity, and focus back on goals that are being forgotten. Experiments conducted in a new modular-goal robotic environment show the resulting developmental self-organization of a learning curriculum, and demonstrate properties of robustness to distracting goals, forgetting and changes in body properties.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes