CVFeb 27, 2018

Multi-View Silhouette and Depth Decomposition for High Resolution 3D Object Representation

arXiv:1802.09987v356 citations
Originality Incremental advance
AI Analysis

This addresses the problem of efficiently generating high-resolution 3D objects for computer vision and graphics applications, representing an incremental advance with specific gains in resolution and performance.

The paper tackles scaling deep generative shape models to high-resolution 3D objects by introducing a method that performs super-resolution on six orthographic depth projections, achieving state-of-the-art performance on 3D object reconstruction from RGB images on ShapeNet and generating objects at resolutions up to 512×512×512.

We consider the problem of scaling deep generative shape models to high-resolution. Drawing motivation from the canonical view representation of objects, we introduce a novel method for the fast up-sampling of 3D objects in voxel space through networks that perform super-resolution on the six orthographic depth projections. This allows us to generate high-resolution objects with more efficient scaling than methods which work directly in 3D. We decompose the problem of 2D depth super-resolution into silhouette and depth prediction to capture both structure and fine detail. This allows our method to generate sharp edges more easily than an individual network. We evaluate our work on multiple experiments concerning high-resolution 3D objects, and show our system is capable of accurately predicting novel objects at resolutions as large as 512$\mathbf{\times}$512$\mathbf{\times}$512 -- the highest resolution reported for this task. We achieve state-of-the-art performance on 3D object reconstruction from RGB images on the ShapeNet dataset, and further demonstrate the first effective 3D super-resolution method.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes