ROJun 7, 2021

Learning to Detect Multi-Modal Grasps for Dexterous Grasping in Dense Clutter

arXiv:2106.03919v212 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of reliable robotic grasping in dense clutter, though it appears incremental as it builds on existing grasp detection methods by adding multi-modal capabilities.

The paper tackles the problem of dexterous grasping in cluttered environments by proposing a multi-modal grasp detection approach that predicts success probabilities for different grasp types from point clouds, achieving an 8.5% higher object retrieval rate compared to baselines.

We propose an approach to multi-modal grasp detection that jointly predicts the probabilities that several types of grasps succeed at a given grasp pose. Given a partial point cloud of a scene, the algorithm proposes a set of feasible grasp candidates, then estimates the probabilities that a grasp of each type would succeed at each candidate pose. Predicting grasp success probabilities directly from point clouds makes our approach agnostic to the number and placement of depth sensors at execution time. We evaluate our system both in simulation and on a real robot with a Robotiq 3-Finger Adaptive Gripper and compare our network against several baselines that perform fewer types of grasps. Our experiments show that a system that explicitly models grasp type achieves an object retrieval rate 8.5% higher in a complex cluttered environment than our highest-performing baseline.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes