CV AI GRApr 5, 2023

Generative Novel View Synthesis with 3D-Aware Diffusion Models

Eric R. Chan, Koki Nagano, Matthew A. Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy, Miika Aittala, Shalini De Mello, Tero Karras, Gordon Wetzstein

NVIDIA

arXiv:2304.02602v140.2322 citationsh-index: 76

Originality Highly original

AI Analysis

This addresses the challenge of 3D-aware novel view synthesis for computer vision applications, representing a novel method rather than an incremental improvement.

The paper tackles the problem of generating novel 3D views from a single image using a diffusion-based model that incorporates a 3D feature volume as a geometry prior, achieving state-of-the-art results on synthetic renderings and real-world objects.

We present a diffusion-based model for 3D-aware generative novel view synthesis from as few as a single input image. Our model samples from the distribution of possible renderings consistent with the input and, even in the presence of ambiguity, is capable of rendering diverse and plausible novel views. To achieve this, our method makes use of existing 2D diffusion backbones but, crucially, incorporates geometry priors in the form of a 3D feature volume. This latent feature field captures the distribution over possible scene representations and improves our method's ability to generate view-consistent novel renderings. In addition to generating novel views, our method has the ability to autoregressively synthesize 3D-consistent sequences. We demonstrate state-of-the-art results on synthetic renderings and room-scale scenes; we also show compelling results for challenging, real-world objects.

View on arXiv PDF

Similar