CVMay 24, 2024

ExactDreamer: High-Fidelity Text-to-3D Content Creation via Exact Score Matching

arXiv:2405.15914v14 citationsh-index: 20
Originality Incremental advance
AI Analysis

This addresses high-fidelity 3D content creation for applications like gaming and VR, representing an incremental improvement over prior methods like SDS and ISM.

The paper tackles the problem of over-smoothing and reconstruction errors in text-to-3D generation by proposing Exact Score Matching (ESM), which uses auxiliary variables to ensure exact recovery in the DDIM reverse process, resulting in superior detailed generation compared to existing methods.

Text-to-3D content creation is a rapidly evolving research area. Given the scarcity of 3D data, current approaches often adapt pre-trained 2D diffusion models for 3D synthesis. Among these approaches, Score Distillation Sampling (SDS) has been widely adopted. However, the issue of over-smoothing poses a significant limitation on the high-fidelity generation of 3D models. To address this challenge, LucidDreamer replaces the Denoising Diffusion Probabilistic Model (DDPM) in SDS with the Denoising Diffusion Implicit Model (DDIM) to construct Interval Score Matching (ISM). However, ISM inevitably inherits inconsistencies from DDIM, causing reconstruction errors during the DDIM inversion process. This results in poor performance in the detailed generation of 3D objects and loss of content. To alleviate these problems, we propose a novel method named Exact Score Matching (ESM). Specifically, ESM leverages auxiliary variables to mathematically guarantee exact recovery in the DDIM reverse process. Furthermore, to effectively capture the dynamic changes of the original and auxiliary variables, the LoRA of a pre-trained diffusion model implements these exact paths. Extensive experiments demonstrate the effectiveness of ESM in text-to-3D generation, particularly highlighting its superiority in detailed generation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes