RO AI LGMar 8, 2019

Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered Scenes

Bohan Wu, Iretiayo Akinola, Peter K. Allen

arXiv:1903.03227v410.937 citations

Originality Highly original

AI Analysis

This addresses the problem of sim-to-real gaps and high sample complexity in robotic grasping for researchers and practitioners, representing a novel method for a known bottleneck.

The paper tackled multi-fingered robotic grasping in cluttered scenes by developing a reinforcement learning policy that operates in pixel space and uses a novel attention mechanism, achieving a high grasp success rate in real-world transfer.

Recent advances in on-policy reinforcement learning (RL) methods enabled learning agents in virtual environments to master complex tasks with high-dimensional and continuous observation and action spaces. However, leveraging this family of algorithms in multi-fingered robotic grasping remains a challenge due to large sim-to-real fidelity gaps and the high sample complexity of on-policy RL algorithms. This work aims to bridge these gaps by first reinforcement-learning a multi-fingered robotic grasping policy in simulation that operates in the pixel space of the input: a single depth image. Using a mapping from pixel space to Cartesian space according to the depth map, this method transfers to the real world with high fidelity and introduces a novel attention mechanism that substantially improves grasp success rate in cluttered environments. Finally, the direct-generative nature of this method allows learning of multi-fingered grasps that have flexible end-effector positions, orientations and rotations, as well as all degrees of freedom of the hand.

View on arXiv PDF

Similar