LG AI MLJan 1, 2020

Options of Interest: Temporal Abstraction with Interest Functions

Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre-Luc Bacon, Doina Precup

arXiv:2001.00271v118.952 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses a key bottleneck in option discovery for reinforcement learning agents, offering an incremental improvement by enabling more interpretable and reusable temporal abstractions.

The paper tackles the challenge of learning initiation sets for temporal abstraction in reinforcement learning by introducing interest functions as a generalization suitable for function approximation, and demonstrates the approach's effectiveness in discrete and continuous environments with quantitative and qualitative results.

Temporal abstraction refers to the ability of an agent to use behaviours of controllers which act for a limited, variable amount of time. The options framework describes such behaviours as consisting of a subset of states in which they can initiate, an internal policy and a stochastic termination condition. However, much of the subsequent work on option discovery has ignored the initiation set, because of difficulty in learning it from data. We provide a generalization of initiation sets suitable for general function approximation, by defining an interest function associated with an option. We derive a gradient-based learning algorithm for interest functions, leading to a new interest-option-critic architecture. We investigate how interest functions can be leveraged to learn interpretable and reusable temporal abstractions. We demonstrate the efficacy of the proposed approach through quantitative and qualitative results, in both discrete and continuous environments.

View on arXiv PDF Code

Similar