ROMay 31

GraspGen-X: Cross-Embodiment 6-DOF Diffusion-based Grasping

arXiv:2606.0099830.8
Predicted impact top 14% in RO · last 90 daysOriginality Incremental advance
AI Analysis

For robotics, this enables grasping with diverse grippers without retraining, addressing a key bottleneck in robot manipulation.

The paper tackles cross-embodiment 6-DOF grasping, requiring generalization to novel grippers and objects. Their diffusion-based model, trained on 2 billion grasps, achieves best zero-shot generalization to novel real-world grippers and objects over baselines.

We study cross-embodiment 6-DOF robot grasping. Unlike prior works, we require the model not only to generalize to novel objects / scenes but also to novel gripper morphologies and physical grasping processes. Our method extends diffusion model based generative 6-DOF grasping models to condition on the additional gripper's representation. We propose a swept-volume heuristic for encoding the gripper. We train our cross-embodiment model with procedural grippers and a large-scale dataset of 2 Billion grasps. In simulation experiments, our model has the best zero-shot generalization to novel real-world grippers and objects over baseline methods. Our model also serves as a good initialization for fine-tuning to adapt to novel grippers. In ablations, we demonstrate the efficiency of our sweep-volume gripper representation and our procedural gripper training dataset. Last, we show zero-shot generalization to real-world novel grippers for 6-DOF grasping, surpassing baselines in cross-embodiment generalization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes