CVDec 13, 2023

Global Latent Neural Rendering

arXiv:2312.08338v210 citationsh-index: 10CVPR
Originality Incremental advance
AI Analysis

This work addresses the challenge of synthesizing novel views from sparse inputs for applications in computer vision and graphics, representing an incremental improvement over prior methods.

The paper tackles the problem of generalizable novel view synthesis by proposing a global rendering operator that processes all camera rays jointly, instead of treating pixels independently, and demonstrates that their method consistently outperforms existing approaches by significant margins.

A recent trend among generalizable novel view synthesis methods is to learn a rendering operator acting over single camera rays. This approach is promising because it removes the need for explicit volumetric rendering, but it effectively treats target images as collections of independent pixels. Here, we propose to learn a global rendering operator acting over all camera rays jointly. We show that the right representation to enable such rendering is a 5-dimensional plane sweep volume consisting of the projection of the input images on a set of planes facing the target camera. Based on this understanding, we introduce our Convolutional Global Latent Renderer (ConvGLR), an efficient convolutional architecture that performs the rendering operation globally in a low-resolution latent space. Experiments on various datasets under sparse and generalizable setups show that our approach consistently outperforms existing methods by significant margins.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes