ROLGMay 11, 2021

Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning

arXiv:2105.04962v132 citations
Originality Incremental advance
AI Analysis

This addresses a modularity issue in robotics and AI for motion planning, offering an incremental improvement over existing summation-based methods.

The paper tackles the problem of conflicting behaviors in reactive motion generation when summing independent policies by introducing Composable Energy Policies (CEP), which optimizes over the product of stochastic policies to choose actions satisfying all objectives, and shows it adapts to reinforcement learning for hierarchical integration of prior distributions.

Reactive motion generation problems are usually solved by computing actions as a sum of policies. However, these policies are independent of each other and thus, they can have conflicting behaviors when summing their contributions together. We introduce Composable Energy Policies (CEP), a novel framework for modular reactive motion generation. CEP computes the control action by optimization over the product of a set of stochastic policies. This product of policies will provide a high probability to those actions that satisfy all the components and low probability to the others. Optimizing over the product of the policies avoids the detrimental effect of conflicting behaviors between policies choosing an action that satisfies all the objectives. Besides, we show that CEP naturally adapts to the Reinforcement Learning problem allowing us to integrate, in a hierarchical fashion, any distribution as prior, from multimodal distributions to non-smooth distributions and learn a new policy given them.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes