CV AI LGMar 18, 2023

3DQD: Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process

Yuhan Li, Yishun Dou, Xuanhong Chen, Bingbing Ni, Yilin Sun, Yutian Liu, Fuzhen Wang

arXiv:2303.10406v116.440 citationsh-index: 60Has Code

Originality Highly original

AI Analysis

It addresses 3D shape generation for computer vision and graphics applications, representing an incremental improvement with hybrid methods.

The paper tackles 3D shape generation for tasks like unconditional generation and point cloud completion by developing a prior model that uses a VQ-VAE for local geometry and a discrete diffusion generator for structural dependencies, achieving superior performance in experiments.

We develop a generalized 3D shape generation prior model, tailored for multiple 3D tasks including unconditional shape generation, point cloud completion, and cross-modality shape generation, etc. On one hand, to precisely capture local fine detailed shape information, a vector quantized variational autoencoder (VQ-VAE) is utilized to index local geometry from a compactly learned codebook based on a broad set of task training data. On the other hand, a discrete diffusion generator is introduced to model the inherent structural dependencies among different tokens. In the meantime, a multi-frequency fusion module (MFM) is developed to suppress high-frequency shape feature fluctuations, guided by multi-frequency contextual information. The above designs jointly equip our proposed 3D shape prior model with high-fidelity, diverse features as well as the capability of cross-modality alignment, and extensive experiments have demonstrated superior performances on various 3D shape generation tasks.

View on arXiv PDF Code

Similar