CVAILGROAug 26, 2017

3D Object Reconstruction from a Single Depth View with Adversarial Learning

arXiv:1708.07969v1207 citationsHas Code
Originality Highly original
AI Analysis

This addresses the challenge of 3D reconstruction for robotics or AR/VR applications with minimal input, though it is incremental as it builds on GAN and autoencoder techniques.

The paper tackles the problem of reconstructing complete 3D objects from a single depth view, achieving state-of-the-art performance on large synthetic datasets with significant improvements over existing methods.

In this paper, we propose a novel 3D-RecGAN approach, which reconstructs the complete 3D structure of a given object from a single arbitrary depth view using generative adversarial networks. Unlike the existing work which typically requires multiple views of the same object or class labels to recover the full 3D geometry, the proposed 3D-RecGAN only takes the voxel grid representation of a depth view of the object as input, and is able to generate the complete 3D occupancy grid by filling in the occluded/missing regions. The key idea is to combine the generative capabilities of autoencoders and the conditional Generative Adversarial Networks (GAN) framework, to infer accurate and fine-grained 3D structures of objects in high-dimensional voxel space. Extensive experiments on large synthetic datasets show that the proposed 3D-RecGAN significantly outperforms the state of the art in single view 3D object reconstruction, and is able to reconstruct unseen types of objects. Our code and data are available at: https://github.com/Yang7879/3D-RecGAN.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes