CVJul 20, 2022

Monocular 3D Object Reconstruction with GAN Inversion

arXiv:2207.10061v116 citationsh-index: 128Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of reconstructing textured 3D meshes from single images for in-the-wild objects, with incremental improvements in generalization to deformable objects.

The paper tackles monocular 3D object reconstruction by using a pre-trained 3D GAN's generative prior to regularize mesh geometry and texture, achieving faithful reconstructions with consistent results across observed and unobserved parts.

Recovering a textured 3D mesh from a monocular image is highly challenging, particularly for in-the-wild objects that lack 3D ground truths. In this work, we present MeshInversion, a novel framework to improve the reconstruction by exploiting the generative prior of a 3D GAN pre-trained for 3D textured mesh synthesis. Reconstruction is achieved by searching for a latent space in the 3D GAN that best resembles the target mesh in accordance with the single view observation. Since the pre-trained GAN encapsulates rich 3D semantics in terms of mesh geometry and texture, searching within the GAN manifold thus naturally regularizes the realness and fidelity of the reconstruction. Importantly, such regularization is directly applied in the 3D space, providing crucial guidance of mesh parts that are unobserved in the 2D space. Experiments on standard benchmarks show that our framework obtains faithful 3D reconstructions with consistent geometry and texture across both observed and unobserved parts. Moreover, it generalizes well to meshes that are less commonly seen, such as the extended articulation of deformable objects. Code is released at https://github.com/junzhezhang/mesh-inversion

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes