AILGAug 7, 2017

Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning

arXiv:1708.02190v3206 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of autonomous skill acquisition in robotics and AI, offering a novel approach to curriculum learning, though it appears incremental in building on prior intrinsic motivation concepts.

The paper tackles the problem of enabling autonomous developmental learning in machines by introducing Intrinsically Motivated Goal Exploration Processes (IMGEP), which automatically generate a learning curriculum without external goals, resulting in the discovery of diverse skills, such as nested tool use, in experiments including a real humanoid robot exploring hundreds of continuous dimensions.

Intrinsically motivated spontaneous exploration is a key enabler of autonomous developmental learning in human children. It enables the discovery of skill repertoires through autotelic learning, i.e. the self-generation, self-selection, self-ordering and self-experimentation of learning goals. We present an algorithmic approach called Intrinsically Motivated Goal Exploration Processes (IMGEP) to enable similar properties of autonomous learning in machines. The IMGEP architecture relies on several principles: 1) self-generation of goals, generalized as parameterized fitness functions; 2) selection of goals based on intrinsic rewards; 3) exploration with incremental goal-parameterized policy search and exploitation with a batch learning algorithm; 4) systematic reuse of information acquired when targeting a goal for improving towards other goals. We present a particularly efficient form of IMGEP, called AMB, that uses a population-based policy and an object-centered spatio-temporal modularity. We provide several implementations of this architecture and demonstrate their ability to automatically generate a learning curriculum within several experimental setups. One of these experiments includes a real humanoid robot exploring multiple spaces of goals with several hundred continuous dimensions and with distractors. While no particular target goal is provided to these autotelic agents, this curriculum allows the discovery of diverse skills that act as stepping stones for learning more complex skills, e.g. nested tool use.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes