CVGRDec 16, 2024

MeshArt: Generating Articulated Meshes with Structure-Guided Transformers

arXiv:2412.11596v242 citationsh-index: 12CVPR
AI Analysis

This work addresses the need for creating interactive virtual assets in domains like gaming and simulation, representing an incremental advance in 3D mesh generation.

The paper tackles the problem of generating articulated 3D meshes with realistic and functional structures, achieving a 57.1% improvement in structure coverage and a 209-point improvement in mesh generation FID over state-of-the-art methods.

Articulated 3D object generation is fundamental for creating realistic, functional, and interactable virtual assets which are not simply static. We introduce MeshArt, a hierarchical transformer-based approach to generate articulated 3D meshes with clean, compact geometry, reminiscent of human-crafted 3D models. We approach articulated mesh generation in a part-by-part fashion across two stages. First, we generate a high-level articulation-aware object structure; then, based on this structural information, we synthesize each part's mesh faces. Key to our approach is modeling both articulation structures and part meshes as sequences of quantized triangle embeddings, leading to a unified hierarchical framework with transformers for autoregressive generation. Object part structures are first generated as their bounding primitives and articulation modes; a second transformer, guided by these articulation structures, then generates each part's mesh triangles. To ensure coherency among generated parts, we introduce structure-guided conditioning that also incorporates local part mesh connectivity. MeshArt shows significant improvements over state of the art, with 57.1% improvement in structure coverage and a 209-point improvement in mesh generation FID.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes