CVMar 26, 2023

BlobGAN-3D: A Spatially-Disentangled 3D-Aware Generative Model for Indoor Scenes

arXiv:2303.14706v19 citationsh-index: 70
Originality Incremental advance
AI Analysis

This work addresses the problem of multi-object scene editing for researchers and practitioners in 3D-aware generative modeling, though it is incremental as it builds directly on the 2D BlobGAN framework.

The paper tackled the challenge of realistic object-level editing in multi-object 3D-aware image synthesis by proposing BlobGAN-3D, a 3D-aware generative model that extends 2D BlobGAN into 3D blobs, achieving comparable image quality to baselines while enabling camera pose control and object-level editing in indoor scenes.

3D-aware image synthesis has attracted increasing interest as it models the 3D nature of our real world. However, performing realistic object-level editing of the generated images in the multi-object scenario still remains a challenge. Recently, a 2D GAN termed BlobGAN has demonstrated great multi-object editing capabilities on real-world indoor scene datasets. In this work, we propose BlobGAN-3D, which is a 3D-aware improvement of the original 2D BlobGAN. We enable explicit camera pose control while maintaining the disentanglement for individual objects in the scene by extending the 2D blobs into 3D blobs. We keep the object-level editing capabilities of BlobGAN and in addition allow flexible control over the 3D location of the objects in the scene. We test our method on real-world indoor datasets and show that our method can achieve comparable image quality compared to the 2D BlobGAN and other 3D-aware GAN baselines while being able to enable camera pose control and object-level editing in the challenging multi-object real-world scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes