CVAIGRMar 23, 2023

DreamBooth3D: Subject-Driven Text-to-3D Generation

arXiv:2303.13508v2283 citationsh-index: 59
AI Analysis

This addresses the need for efficient subject-driven 3D generation in graphics and AI, though it is incremental as it builds on existing methods.

The paper tackles the problem of generating personalized 3D assets from a few images by combining DreamBooth and DreamFusion, overcoming overfitting through a 3-stage optimization to produce high-quality, subject-specific 3D models with novel attributes.

We present DreamBooth3D, an approach to personalize text-to-3D generative models from as few as 3-6 casually captured images of a subject. Our approach combines recent advances in personalizing text-to-image models (DreamBooth) with text-to-3D generation (DreamFusion). We find that naively combining these methods fails to yield satisfactory subject-specific 3D assets due to personalized text-to-image models overfitting to the input viewpoints of the subject. We overcome this through a 3-stage optimization strategy where we jointly leverage the 3D consistency of neural radiance fields together with the personalization capability of text-to-image models. Our method can produce high-quality, subject-specific 3D assets with text-driven modifications such as novel poses, colors and attributes that are not seen in any of the input images of the subject.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes