ROApr 24, 2019

Bayesian Gaussian mixture model for robotic policy imitation

arXiv:1904.10716v254 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of ensuring safe and appropriate robot behavior in unknown states during imitation learning, which is incremental as it builds on existing imitation methods with uncertainty quantification.

The paper tackles the problem of robotic policy imitation where small errors cause the robot to leave demonstrated states, by proposing a Bayesian method to quantify action uncertainty and fuse imitation with additional policies. The approach is validated on a Panda robot for three manipulation tasks in continuous domains.

A common approach to learn robotic skills is to imitate a demonstrated policy. Due to the compounding of small errors and perturbations, this approach may let the robot leave the states in which the demonstrations were provided. This requires the consideration of additional strategies to guarantee that the robot will behave appropriately when facing unknown states. We propose to use a Bayesian method to quantify the action uncertainty at each state. The proposed Bayesian method is simple to set up, computationally efficient, and can adapt to a wide range of problems. Our approach exploits the estimated uncertainty to fuse the imitation policy with additional policies. It is validated on a Panda robot with the imitation of three manipulation tasks in the continuous domain using different control input/state pairs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes