CVAIIVAug 20, 2024

Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds

arXiv:2408.10543v13 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses compression efficiency and quality for 3D point clouds, which is incremental as it applies diffusion models to a specific domain.

The paper tackles 3D point cloud compression by proposing Diff-PCC, a diffusion-based method that achieves state-of-the-art performance with 7.711 dB BD-PSNR gains against the G-PCC standard at ultra-low bitrate and superior subjective quality.

Stable diffusion networks have emerged as a groundbreaking development for their ability to produce realistic and detailed visual content. This characteristic renders them ideal decoders, capable of producing high-quality and aesthetically pleasing reconstructions. In this paper, we introduce the first diffusion-based point cloud compression method, dubbed Diff-PCC, to leverage the expressive power of the diffusion model for generative and aesthetically superior decoding. Different from the conventional autoencoder fashion, a dual-space latent representation is devised in this paper, in which a compressor composed of two independent encoding backbones is considered to extract expressive shape latents from distinct latent spaces. At the decoding side, a diffusion-based generator is devised to produce high-quality reconstructions by considering the shape latents as guidance to stochastically denoise the noisy point clouds. Experiments demonstrate that the proposed Diff-PCC achieves state-of-the-art compression performance (e.g., 7.711 dB BD-PSNR gains against the latest G-PCC standard at ultra-low bitrate) while attaining superior subjective quality. Source code will be made publicly available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes