CVAIGRApr 5, 2023

Generative Novel View Synthesis with 3D-Aware Diffusion Models

NVIDIA
arXiv:2304.02602v1319 citationsh-index: 76
Originality Highly original
AI Analysis

This addresses the challenge of 3D-aware novel view synthesis for computer vision applications, representing a novel method rather than an incremental improvement.

The paper tackles the problem of generating novel 3D views from a single image using a diffusion-based model that incorporates a 3D feature volume as a geometry prior, achieving state-of-the-art results on synthetic renderings and real-world objects.

We present a diffusion-based model for 3D-aware generative novel view synthesis from as few as a single input image. Our model samples from the distribution of possible renderings consistent with the input and, even in the presence of ambiguity, is capable of rendering diverse and plausible novel views. To achieve this, our method makes use of existing 2D diffusion backbones but, crucially, incorporates geometry priors in the form of a 3D feature volume. This latent feature field captures the distribution over possible scene representations and improves our method's ability to generate view-consistent novel renderings. In addition to generating novel views, our method has the ability to autoregressively synthesize 3D-consistent sequences. We demonstrate state-of-the-art results on synthetic renderings and room-scale scenes; we also show compelling results for challenging, real-world objects.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes