ROCVJul 8, 2024

TARGO: Benchmarking Target-driven Object Grasping under Occlusions

arXiv:2407.06168v17 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses the challenge of target-driven grasping in cluttered environments for robotics, but it is incremental as it builds on existing grasping models with a new benchmark and method.

The authors tackled the problem of robotic grasping under occlusions by establishing a new benchmark dataset (TARGO) and found that current state-of-the-art models struggle as occlusion increases, with their proposed TARGO-Net performing most robustly.

Recent advances in predicting 6D grasp poses from a single depth image have led to promising performance in robotic grasping. However, previous grasping models face challenges in cluttered environments where nearby objects impact the target object's grasp. In this paper, we first establish a new benchmark dataset for TARget-driven Grasping under Occlusions, named TARGO. We make the following contributions: 1) We are the first to study the occlusion level of grasping. 2) We set up an evaluation benchmark consisting of large-scale synthetic data and part of real-world data, and we evaluated five grasp models and found that even the current SOTA model suffers when the occlusion level increases, leaving grasping under occlusion still a challenge. 3) We also generate a large-scale training dataset via a scalable pipeline, which can be used to boost the performance of grasping under occlusion and generalized to the real world. 4) We further propose a transformer-based grasping model involving a shape completion module, termed TARGO-Net, which performs most robustly as occlusion increases. Our benchmark dataset can be found at https://TARGO-benchmark.github.io/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes