CVDec 12, 2022

RGBD2: Generative Scene Synthesis via Incremental View Inpainting using RGBD Diffusion Models

arXiv:2212.05993v253 citationsh-index: 55
Originality Incremental advance
AI Analysis

This addresses the challenge of multi-view inconsistency in 3D scene synthesis from sparse inputs, but it is incremental as it builds on existing RGBD diffusion models with modifications.

The paper tackles the problem of reconstructing 3D scene geometry and colors from sparse RGBD views by introducing RGBD2, which sequentially generates novel RGBD views via inpainting and fuses them into a mesh, achieving superior performance on the ScanNet dataset.

We address the challenge of recovering an underlying scene geometry and colors from a sparse set of RGBD view observations. In this work, we present a new solution termed RGBD$^2$ that sequentially generates novel RGBD views along a camera trajectory, and the scene geometry is simply the fusion result of these views. More specifically, we maintain an intermediate surface mesh used for rendering new RGBD views, which subsequently becomes complete by an inpainting network; each rendered RGBD view is later back-projected as a partial surface and is supplemented into the intermediate mesh. The use of intermediate mesh and camera projection helps solve the tough problem of multi-view inconsistency. We practically implement the RGBD inpainting network as a versatile RGBD diffusion model, which is previously used for 2D generative modeling; we make a modification to its reverse diffusion process to enable our use. We evaluate our approach on the task of 3D scene synthesis from sparse RGBD inputs; extensive experiments on the ScanNet dataset demonstrate the superiority of our approach over existing ones. Project page: https://jblei.site/proj/rgbd-diffusion.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes