AILGROAug 13, 2020

Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter

arXiv:2008.06073v152 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of robotic object retrieval in cluttered environments, which is incremental as it builds on existing deep RL methods with specific enhancements for sample efficiency and effectiveness.

The paper tackles the problem of retrieving target objects occluded by clutter using a novel deep reinforcement learning procedure that combines teacher-aided exploration, a critic with privileged information, and mid-level representations, resulting in faster training and more efficient uncovering solutions than baselines, with an average improvement in graspability of the target object.

When searching for objects in cluttered environments, it is often necessary to perform complex interactions in order to move occluding objects out of the way and fully reveal the object of interest and make it graspable. Due to the complexity of the physics involved and the lack of accurate models of the clutter, planning and controlling precise predefined interactions with accurate outcome is extremely hard, when not impossible. In problems where accurate (forward) models are lacking, Deep Reinforcement Learning (RL) has shown to be a viable solution to map observations (e.g. images) to good interactions in the form of close-loop visuomotor policies. However, Deep RL is sample inefficient and fails when applied directly to the problem of unoccluding objects based on images. In this work we present a novel Deep RL procedure that combines i) teacher-aided exploration, ii) a critic with privileged information, and iii) mid-level representations, resulting in sample efficient and effective learning for the problem of uncovering a target object occluded by a heap of unknown objects. Our experiments show that our approach trains faster and converges to more efficient uncovering solutions than baselines and ablations, and that our uncovering policies lead to an average improvement in the graspability of the target object, facilitating downstream retrieval applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes