CV AIDec 28, 2018

Learning to Reconstruct Shapes from Unseen Classes

Xiuming Zhang, Zhoutong Zhang, Chengkai Zhang, Joshua B. Tenenbaum, William T. Freeman, Jiajun Wu

arXiv:1812.11166v127.5163 citations

Originality Highly original

AI Analysis

This addresses the issue of poor generalization in 3D reconstruction for computer vision applications, though it appears incremental as it builds on existing methods with a hybrid approach.

The paper tackles the problem of single-image 3D reconstruction algorithms being biased by training classes, presenting GenRe to capture class-agnostic shape priors, resulting in good performance and generalization to novel object categories not seen during training.

From a single image, humans are able to perceive the full 3D shape of an object by exploiting learned shape priors from everyday life. Contemporary single-image 3D reconstruction algorithms aim to solve this task in a similar fashion, but often end up with priors that are highly biased by training classes. Here we present an algorithm, Generalizable Reconstruction (GenRe), designed to capture more generic, class-agnostic shape priors. We achieve this with an inference network and training procedure that combine 2.5D representations of visible surfaces (depth and silhouette), spherical shape representations of both visible and non-visible surfaces, and 3D voxel-based representations, in a principled manner that exploits the causal structure of how 3D shapes give rise to 2D images. Experiments demonstrate that GenRe performs well on single-view shape reconstruction, and generalizes to diverse novel objects from categories not seen during training.

View on arXiv PDF

Similar