ROSep 4, 2019

Toward Sim-to-Real Directional Semantic Grasping

arXiv:1909.02075v35 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of precise object grasping from specific directions for robotics applications, but it is incremental as it builds on existing sim-to-real and reinforcement learning methods.

The paper tackles directional semantic grasping by using deep reinforcement learning with a DDQN and CEM to map RGB images to robot commands, trained entirely in simulation with domain randomization, showing promising results in both simulation and real-world tests.

We address the problem of directional semantic grasping, that is, grasping a specific object from a specific direction. We approach the problem using deep reinforcement learning via a double deep Q-network (DDQN) that learns to map downsampled RGB input images from a wrist-mounted camera to Q-values, which are then translated into Cartesian robot control commands via the cross-entropy method (CEM). The network is learned entirely on simulated data generated by a custom robot simulator that models both physical reality (contacts) and perceptual quality (high-quality rendering). The reality gap is bridged using domain randomization. The system is an example of end-to-end (mapping input monocular RGB images to output Cartesian motor commands) grasping of objects from multiple pre-defined object-centric orientations, such as from the side or top. We show promising results in both simulation and the real world, along with some challenges faced and the need for future research in this area.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes