CVNov 26, 2025

UniArt: Unified 3D Representation for Generating 3D Articulated Objects with Open-Set Articulation

arXiv:2511.21887v12 citations
Originality Highly original
AI Analysis

This addresses the costly and difficult-to-scale manual construction of articulated 3D objects for realistic simulation and embodied robotics, representing a novel method rather than an incremental improvement.

The paper tackles the problem of generating articulated 3D objects from a single image, presenting UniArt, a diffusion-based framework that synthesizes such objects end-to-end, achieving state-of-the-art mesh quality and articulation accuracy on the PartNet-Mobility benchmark.

Articulated 3D objects play a vital role in realistic simulation and embodied robotics, yet manually constructing such assets remains costly and difficult to scale. In this paper, we present UniArt, a diffusion-based framework that directly synthesizes fully articulated 3D objects from a single image in an end-to-end manner. Unlike prior multi-stage techniques, UniArt establishes a unified latent representation that jointly encodes geometry, texture, part segmentation, and kinematic parameters. We introduce a reversible joint-to-voxel embedding, which spatially aligns articulation features with volumetric geometry, enabling the model to learn coherent motion behaviors alongside structural formation. Furthermore, we formulate articulation type prediction as an open-set problem, removing the need for fixed joint semantics and allowing generalization to novel joint categories and unseen object types. Experiments on the PartNet-Mobility benchmark demonstrate that UniArt achieves state-of-the-art mesh quality and articulation accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes