ROCVApr 26, 2025

Dexonomy: Synthesizing All Dexterous Grasp Types in a Grasp Taxonomy

arXiv:2504.18829v223 citationsh-index: 6Robotics
Originality Incremental advance
AI Analysis

This work addresses the problem of scalable dexterous grasping for robotics by enabling synthesis across all grasp types in a taxonomy, which is incremental as it builds on existing grasp synthesis methods with a novel two-stage optimization approach.

The paper tackles the challenge of synthesizing diverse dexterous grasps for robots by proposing an efficient pipeline that generates contact-rich, physically plausible grasps for any object and grasp type, resulting in a dataset of 9.5M grasps and an 82.3% success rate in real-world experiments.

Generalizable dexterous grasping with suitable grasp types is a fundamental skill for intelligent robots. Developing such skills requires a large-scale and high-quality dataset that covers numerous grasp types (i.e., at least those categorized by the GRASP taxonomy), but collecting such data is extremely challenging. Existing automatic grasp synthesis methods are often limited to specific grasp types or object categories, hindering scalability. This work proposes an efficient pipeline capable of synthesizing contact-rich, penetration-free, and physically plausible grasps for any grasp type, object, and articulated hand. Starting from a single human-annotated template for each hand and grasp type, our pipeline tackles the complicated synthesis problem with two stages: optimize the object to fit the hand template first, and then locally refine the hand to fit the object in simulation. To validate the synthesized grasps, we introduce a contact-aware control strategy that allows the hand to apply the appropriate force at each contact point to the object. Those validated grasps can also be used as new grasp templates to facilitate future synthesis. Experiments show that our method significantly outperforms previous type-unaware grasp synthesis baselines in simulation. Using our algorithm, we construct a dataset containing 10.7k objects and 9.5M grasps, covering 31 grasp types in the GRASP taxonomy. Finally, we train a type-conditional generative model that successfully performs the desired grasp type from single-view object point clouds, achieving an 82.3% success rate in real-world experiments. Project page: https://pku-epic.github.io/Dexonomy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes