ROAILGDec 5, 2022

Accelerating Interactive Human-like Manipulation Learning with GPU-based Simulation and High-quality Demonstrations

arXiv:2212.02126v118 citationsh-index: 57
Originality Incremental advance
AI Analysis

This work addresses the problem of enabling humanoid robots to perform skillful closed-loop manipulation in unstructured environments, representing an incremental improvement through hybrid methods.

The paper tackles the challenge of dexterous manipulation with anthropomorphic robot hands by introducing a framework that combines GPU-based simulation and imitation learning to reduce the interaction data requirements of reinforcement learning, resulting in robust and natural behaviors demonstrated in tasks of daily living.

Dexterous manipulation with anthropomorphic robot hands remains a challenging problem in robotics because of the high-dimensional state and action spaces and complex contacts. Nevertheless, skillful closed-loop manipulation is required to enable humanoid robots to operate in unstructured real-world environments. Reinforcement learning (RL) has traditionally imposed enormous interaction data requirements for optimizing such complex control problems. We introduce a new framework that leverages recent advances in GPU-based simulation along with the strength of imitation learning in guiding policy search towards promising behaviors to make RL training feasible in these domains. To this end, we present an immersive virtual reality teleoperation interface designed for interactive human-like manipulation on contact rich tasks and a suite of manipulation environments inspired by tasks of daily living. Finally, we demonstrate the complementary strengths of massively parallel RL and imitation learning, yielding robust and natural behaviors. Videos of trained policies, our source code, and the collected demonstration datasets are available at https://maltemosbach.github.io/interactive_ human_like_manipulation/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes