ROLGMLApr 29, 2021

On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning

arXiv:2104.14534v12 citations
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of developing general and robust push-recovery strategies for humanoid robots, which is incremental by applying existing methods to a specific domain.

The researchers tackled the problem of enabling humanoid robots to balance and recover from pushes using a model-free deep reinforcement learning approach, resulting in a policy that learns robust whole-body behaviors validated on the iCub humanoid in simulation.

Balancing and push-recovery are essential capabilities enabling humanoid robots to solve complex locomotion tasks. In this context, classical control systems tend to be based on simplified physical models and hard-coded strategies. Although successful in specific scenarios, this approach requires demanding tuning of parameters and switching logic between specifically-designed controllers for handling more general perturbations. We apply model-free Deep Reinforcement Learning for training a general and robust humanoid push-recovery policy in a simulation environment. Our method targets high-dimensional whole-body humanoid control and is validated on the iCub humanoid. Reward components incorporating expert knowledge on humanoid control enable fast learning of several robust behaviors by the same policy, spanning the entire body. We validate our method with extensive quantitative analyses in simulation, including out-of-sample tasks which demonstrate policy robustness and generalization, both key requirements towards real-world robot deployment.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes