CV AI IVAug 20, 2024

Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds

arXiv:2408.10543v15.23 citationsh-index: 5

Originality Incremental advance

AI Analysis

This addresses compression efficiency and quality for 3D point clouds, which is incremental as it applies diffusion models to a specific domain.

The paper tackles 3D point cloud compression by proposing Diff-PCC, a diffusion-based method that achieves state-of-the-art performance with 7.711 dB BD-PSNR gains against the G-PCC standard at ultra-low bitrate and superior subjective quality.

Stable diffusion networks have emerged as a groundbreaking development for their ability to produce realistic and detailed visual content. This characteristic renders them ideal decoders, capable of producing high-quality and aesthetically pleasing reconstructions. In this paper, we introduce the first diffusion-based point cloud compression method, dubbed Diff-PCC, to leverage the expressive power of the diffusion model for generative and aesthetically superior decoding. Different from the conventional autoencoder fashion, a dual-space latent representation is devised in this paper, in which a compressor composed of two independent encoding backbones is considered to extract expressive shape latents from distinct latent spaces. At the decoding side, a diffusion-based generator is devised to produce high-quality reconstructions by considering the shape latents as guidance to stochastically denoise the noisy point clouds. Experiments demonstrate that the proposed Diff-PCC achieves state-of-the-art compression performance (e.g., 7.711 dB BD-PSNR gains against the latest G-PCC standard at ultra-low bitrate) while attaining superior subjective quality. Source code will be made publicly available.

View on arXiv PDF

Similar