MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly
This addresses the problem of generating high-resolution artist meshes for practical applications, representing a strong specific gain in the domain of 3D mesh generation.
The paper tackles the challenge of scaling artist-designed meshes to high triangle numbers by introducing MeshMosaic, a local-to-global framework that generates meshes with over 100K triangles, significantly outperforming prior methods limited to around 8K faces in geometric fidelity and user preference.
Scaling artist-designed meshes to high triangle numbers remains challenging for autoregressive generative models. Existing transformer-based methods suffer from long-sequence bottlenecks and limited quantization resolution, primarily due to the large number of tokens required and constrained quantization granularity. These issues prevent faithful reproduction of fine geometric details and structured density patterns. We introduce MeshMosaic, a novel local-to-global framework for artist mesh generation that scales to over 100K triangles--substantially surpassing prior methods, which typically handle only around 8K faces. MeshMosaic first segments shapes into patches, generating each patch autoregressively and leveraging shared boundary conditions to promote coherence, symmetry, and seamless connectivity between neighboring regions. This strategy enhances scalability to high-resolution meshes by quantizing patches individually, resulting in more symmetrical and organized mesh density and structure. Extensive experiments across multiple public datasets demonstrate that MeshMosaic significantly outperforms state-of-the-art methods in both geometric fidelity and user preference, supporting superior detail representation and practical mesh generation for real-world applications.