LG AIJun 12, 2024

When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions

Zhening Li, Gabriel Poesia, Armando Solar-Lezama

arXiv:2406.07897v14.61 citationsHas Code

Originality Incremental advance

AI Analysis

This work provides foundational insights for researchers in hierarchical reinforcement learning, helping guide skill discovery and usage decisions, though it is incremental in refining theoretical understanding.

The paper tackles the problem of characterizing when temporal abstractions (skills) improve reinforcement learning performance, showing theoretically and empirically that skills provide less benefit in environments with less compressible solutions and that they aid exploration more than learning from experience.

Skills are temporal abstractions that are intended to improve reinforcement learning (RL) performance through hierarchical RL. Despite our intuition about the properties of an environment that make skills useful, a precise characterization has been absent. We provide the first such characterization, focusing on the utility of deterministic skills in deterministic sparse-reward environments with finite action spaces. We show theoretically and empirically that RL performance gain from skills is worse in environments where solutions to states are less compressible. Additional theoretical results suggest that skills benefit exploration more than they benefit learning from existing experience, and that using unexpressive skills such as macroactions may worsen RL performance. We hope our findings can guide research on automatic skill discovery and help RL practitioners better decide when and how to use skills.

View on arXiv PDF Code

Similar