GRAICVMay 22

AssetGen: Deployable 3D Asset Generation at Interactive Speed

arXiv:2605.2613798.4
AI Analysis

It addresses the need for fast, deployable 3D asset generation for interactive workflows, particularly in gaming and AR/VR, where user experience and real-time performance are critical.

AssetGen generates high-quality 3D meshes with baked normals and color textures from a single image in 30 seconds (14 seconds for a faster variant), achieving competitive visual quality against commercial solutions while supporting real-time rendering and mobile deployment.

While 3D generation is progressing rapidly, recent work has often focused on obtaining high-resolution assets, leaving user experience and deployability as afterthoughts. We present AssetGen, a 3D generator that focuses instead on these two aspects. Given one reference image, in 30 seconds it produces a high-quality mesh with baked normals, a color texture, and a controlled polygon budget suitable for real-time rendering, including mobile use cases. The AssetGen Flash variant further reduces latency to 14 seconds for interactive and agentic creation loops. Our model generates the object geometry with a coarse-to-refine VecSet framework, which implements mesh simplification, cleaning, and normal baking on the GPU, and a fast parallel UV unwrapping. It then generates textures in a multi-view fashion, followed by backprojection and 3D inpainting. Model distillation, kernel optimization, and pipeline parallelization are co-designed to accelerate the system end-to-end. We introduce numerous automated and blind human evaluations and demonstrate competitive visual quality against leading commercial solutions in 30 seconds and preview-quality results in less than 15 seconds. The final result is a system that supports AI-assisted, deployable 3D content creation in interactive workflows.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes