ROAILGMar 8, 2019

Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered Scenes

arXiv:1903.03227v437 citations
Originality Highly original
AI Analysis

This addresses the problem of sim-to-real gaps and high sample complexity in robotic grasping for researchers and practitioners, representing a novel method for a known bottleneck.

The paper tackled multi-fingered robotic grasping in cluttered scenes by developing a reinforcement learning policy that operates in pixel space and uses a novel attention mechanism, achieving a high grasp success rate in real-world transfer.

Recent advances in on-policy reinforcement learning (RL) methods enabled learning agents in virtual environments to master complex tasks with high-dimensional and continuous observation and action spaces. However, leveraging this family of algorithms in multi-fingered robotic grasping remains a challenge due to large sim-to-real fidelity gaps and the high sample complexity of on-policy RL algorithms. This work aims to bridge these gaps by first reinforcement-learning a multi-fingered robotic grasping policy in simulation that operates in the pixel space of the input: a single depth image. Using a mapping from pixel space to Cartesian space according to the depth map, this method transfers to the real world with high fidelity and introduces a novel attention mechanism that substantially improves grasp success rate in cluttered environments. Finally, the direct-generative nature of this method allows learning of multi-fingered grasps that have flexible end-effector positions, orientations and rotations, as well as all degrees of freedom of the hand.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes