CVAug 23, 2024

Atlas Gaussians Diffusion for 3D Generation

arXiv:2408.13055v316 citationsh-index: 53
Originality Incremental advance
AI Analysis

This addresses the challenge of linking latent and 3D spaces for 3D generation, offering improvements in detail and efficiency, but it appears incremental as it builds on existing latent diffusion and 3D Gaussian techniques.

The paper tackles the problem of high-fidelity and efficient 3D generation by introducing Atlas Gaussians, a novel representation that uses local patches to decode 3D Gaussians, and experiments show it outperforms prior feed-forward native 3D generation methods.

Using the latent diffusion model has proven effective in developing novel 3D generation techniques. To harness the latent diffusion model, a key challenge is designing a high-fidelity and efficient representation that links the latent space and the 3D space. In this paper, we introduce Atlas Gaussians, a novel representation for feed-forward native 3D generation. Atlas Gaussians represent a shape as the union of local patches, and each patch can decode 3D Gaussians. We parameterize a patch as a sequence of feature vectors and design a learnable function to decode 3D Gaussians from the feature vectors. In this process, we incorporate UV-based sampling, enabling the generation of a sufficiently large, and theoretically infinite, number of 3D Gaussian points. The large amount of 3D Gaussians enables the generation of high-quality details. Moreover, due to local awareness of the representation, the transformer-based decoding procedure operates on a patch level, ensuring efficiency. We train a variational autoencoder to learn the Atlas Gaussians representation, and then apply a latent diffusion model on its latent space for learning 3D Generation. Experiments show that our approach outperforms the prior arts of feed-forward native 3D generation. Project page: https://yanghtr.github.io/projects/atlas_gaussians.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes