CVDec 20, 2023

Splatter Image: Ultra-Fast Single-View 3D Reconstruction

arXiv:2312.13150v2389 citationsh-index: 104CVPR
Originality Highly original
AI Analysis

This addresses the problem of fast and high-quality 3D reconstruction from single images for applications in computer vision and graphics, with incremental improvements in speed and performance.

The paper tackles monocular 3D object reconstruction by introducing Splatter Image, an ultra-efficient method that achieves reconstruction at 38 FPS and better results in metrics like PSNR and LPIPS compared to prior works.

We introduce the \method, an ultra-efficient approach for monocular 3D object reconstruction. Splatter Image is based on Gaussian Splatting, which allows fast and high-quality reconstruction of 3D scenes from multiple images. We apply Gaussian Splatting to monocular reconstruction by learning a neural network that, at test time, performs reconstruction in a feed-forward manner, at 38 FPS. Our main innovation is the surprisingly straightforward design of this network, which, using 2D operators, maps the input image to one 3D Gaussian per pixel. The resulting set of Gaussians thus has the form an image, the Splatter Image. We further extend the method take several images as input via cross-view attention. Owning to the speed of the renderer (588 FPS), we use a single GPU for training while generating entire images at each iteration to optimize perceptual metrics like LPIPS. On several synthetic, real, multi-category and large-scale benchmark datasets, we achieve better results in terms of PSNR, LPIPS, and other metrics while training and evaluating much faster than prior works. Code, models, demo and more results are available at https://szymanowiczs.github.io/splatter-image.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes