CVAIApr 9, 2024

Magic-Boost: Boost 3D Generation with Multi-View Conditioned Diffusion

arXiv:2404.06429v32 citationsh-index: 20
Originality Incremental advance
AI Analysis

This addresses the issue of incorrect geometries and blurry textures in 3D content generation for applications like gaming or VR, representing an incremental improvement over existing methods.

The paper tackles the problem of low-quality 3D asset generation from multi-view images by proposing Magic-Boost, a method that refines coarse results using a multi-view conditioned diffusion model and iterative-update strategy, achieving high-quality 3D assets with rich details in about 15 minutes.

Benefiting from the rapid development of 2D diffusion models, 3D content generation has witnessed significant progress. One promising solution is to finetune the pre-trained 2D diffusion models to produce multi-view images and then reconstruct them into 3D assets via feed-forward sparse-view reconstruction models. However, limited by the 3D inconsistency in the generated multi-view images and the low reconstruction resolution of the feed-forward reconstruction models, the generated 3d assets are still limited to incorrect geometries and blurry textures. To address this problem, we present a multi-view based refine method, named Magic-Boost, to further refine the generation results. In detail, we first propose a novel multi-view conditioned diffusion model which extracts 3d prior from the synthesized multi-view images to synthesize high-fidelity novel view images and then introduce a novel iterative-update strategy to adopt it to provide precise guidance to refine the coarse generated results through a fast optimization process. Conditioned on the strong 3d priors extracted from the synthesized multi-view images, Magic-Boost is capable of providing precise optimization guidance that well aligns with the coarse generated 3D assets, enriching the local detail in both geometry and texture within a short time ($\sim15$min). Extensive experiments show Magic-Boost greatly enhances the coarse generated inputs, generates high-quality 3D assets with rich geometric and textural details. (Project Page: https://magic-research.github.io/magic-boost/)

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes