LGAINov 22, 2016

Variational Intrinsic Control

arXiv:1611.07507v1476 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of enabling agents to autonomously learn diverse skills without external rewards, which is incremental as it builds on prior intrinsic control methods.

The paper tackles the problem of discovering intrinsic options in unsupervised reinforcement learning by maximizing the mutual information between options and termination states, resulting in scalable algorithms that demonstrate applicability across various tasks.

In this paper we introduce a new unsupervised reinforcement learning method for discovering the set of intrinsic options available to an agent. This set is learned by maximizing the number of different states an agent can reliably reach, as measured by the mutual information between the set of options and option termination states. To this end, we instantiate two policy gradient based algorithms, one that creates an explicit embedding space of options and one that represents options implicitly. The algorithms also provide an explicit measure of empowerment in a given state that can be used by an empowerment maximizing agent. The algorithm scales well with function approximation and we demonstrate the applicability of the algorithm on a range of tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes