CVAug 12, 2020

Free View Synthesis

arXiv:2008.05511v1386 citations
AI Analysis

This addresses the problem of generating realistic views from arbitrary camera positions for applications like virtual reality, with a method that is more flexible and efficient than prior work.

The paper tackles novel view synthesis from freely distributed input images, enabling free camera movement and handling general scenes without per-scene optimization, and demonstrates state-of-the-art performance on real-world datasets like Tanks and Temples.

We present a method for novel view synthesis from input images that are freely distributed around a scene. Our method does not rely on a regular arrangement of input views, can synthesize images for free camera movement through the scene, and works for general scenes with unconstrained geometric layouts. We calibrate the input images via SfM and erect a coarse geometric scaffold via MVS. This scaffold is used to create a proxy depth map for a novel view of the scene. Based on this depth map, a recurrent encoder-decoder network processes reprojected features from nearby views and synthesizes the new view. Our network does not need to be optimized for a given scene. After training on a dataset, it works in previously unseen environments with no fine-tuning or per-scene optimization. We evaluate the presented approach on challenging real-world datasets, including Tanks and Temples, where we demonstrate successful view synthesis for the first time and substantially outperform prior and concurrent work.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes