ROAICVJul 5, 2021

GraspME -- Grasp Manifold Estimator

arXiv:2107.01836v1
Originality Incremental advance
AI Analysis

This addresses the need for robots to have graspability models for autonomous manipulation tasks, though it appears incremental as it builds on existing methods like Mask R-CNN.

The paper tackles the problem of detecting grasp affordances for objects directly from 2D camera images by introducing GraspME, which estimates grasp manifolds via key points using a Mask R-CNN backbone, achieving an inference speed of 11.5 fps, 94.5% average precision for keypoint estimation, and a mean pixel distance of 1.29.

In this paper, we introduce a Grasp Manifold Estimator (GraspME) to detect grasp affordances for objects directly in 2D camera images. To perform manipulation tasks autonomously it is crucial for robots to have such graspability models of the surrounding objects. Grasp manifolds have the advantage of providing continuously infinitely many grasps, which is not the case when using other grasp representations such as predefined grasp points. For instance, this property can be leveraged in motion optimization to define goal sets as implicit surface constraints in the robot configuration space. In this work, we restrict ourselves to the case of estimating possible end-effector positions directly from 2D camera images. To this extend, we define grasp manifolds via a set of key points and locate them in images using a Mask R-CNN backbone. Using learned features allows generalizing to different view angles, with potentially noisy images, and objects that were not part of the training set. We rely on simulation data only and perform experiments on simple and complex objects, including unseen ones. Our framework achieves an inference speed of 11.5 fps on a GPU, an average precision for keypoint estimation of 94.5% and a mean pixel distance of only 1.29. This shows that we can estimate the objects very well via bounding boxes and segmentation masks as well as approximate the correct grasp manifold's keypoint coordinates.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes