CVLGOct 20, 2021

Style Agnostic 3D Reconstruction via Adversarial Style Transfer

arXiv:2110.10784v15 citations
Originality Incremental advance
AI Analysis

This addresses a challenge in computer vision for applications requiring 3D modeling from real-world images, though it is incremental as it builds on differentiable renderers.

The paper tackles the problem of 3D object reconstruction from single-view images with backgrounds by proposing an adversarial style-transfer and domain adaptation pipeline, eliminating the need for silhouette supervision and achieving better performance than constrained methods.

Reconstructing the 3D geometry of an object from an image is a major challenge in computer vision. Recently introduced differentiable renderers can be leveraged to learn the 3D geometry of objects from 2D images, but those approaches require additional supervision to enable the renderer to produce an output that can be compared to the input image. This can be scene information or constraints such as object silhouettes, uniform backgrounds, material, texture, and lighting. In this paper, we propose an approach that enables a differentiable rendering-based learning of 3D objects from images with backgrounds without the need for silhouette supervision. Instead of trying to render an image close to the input, we propose an adversarial style-transfer and domain adaptation pipeline that allows to translate the input image domain to the rendered image domain. This allows us to directly compare between a translated image and the differentiable rendering of a 3D object reconstruction in order to train the 3D object reconstruction network. We show that the approach learns 3D geometry from images with backgrounds and provides a better performance than constrained methods for single-view 3D object reconstruction on this task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes