LGAICVROOct 28, 2021

Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives

arXiv:2110.15360v1118 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of slow and inefficient training in robotic reinforcement learning for researchers and practitioners, offering a simple yet effective interface change that is incremental but impactful.

The paper tackles the challenge of exploration in robotic reinforcement learning by introducing manually specified, parameterized action primitives, which improve learning efficiency and task performance across diverse domains, significantly outperforming prior methods that rely on offline expert data.

Despite the potential of reinforcement learning (RL) for building general-purpose robotic systems, training RL agents to solve robotics tasks still remains challenging due to the difficulty of exploration in purely continuous action spaces. Addressing this problem is an active area of research with the majority of focus on improving RL methods via better optimization or more efficient exploration. An alternate but important component to consider improving is the interface of the RL algorithm with the robot. In this work, we manually specify a library of robot action primitives (RAPS), parameterized with arguments that are learned by an RL policy. These parameterized primitives are expressive, simple to implement, enable efficient exploration and can be transferred across robots, tasks and environments. We perform a thorough empirical study across challenging tasks in three distinct domains with image input and a sparse terminal reward. We find that our simple change to the action interface substantially improves both the learning efficiency and task performance irrespective of the underlying RL algorithm, significantly outperforming prior methods which learn skills from offline expert data. Code and videos at https://mihdalal.github.io/raps/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes