CVIVAug 3, 2025

Rate-distortion Optimized Point Cloud Preprocessing for Geometry-based Point Cloud Compression

arXiv:2508.01633v1h-index: 17IEEE Transactions on Image Processing
Originality Highly original
AI Analysis

This work enhances legacy compression standards for point cloud applications by improving efficiency without sacrificing interoperability, making it practical for real-world deployment.

The paper tackles the underperformance of the standard geometry-based point cloud compression (G-PCC) compared to deep learning methods by proposing a preprocessing framework that integrates a voxelization network with a differentiable surrogate model, achieving a 38.84% average BD-rate reduction over G-PCC.

Geometry-based point cloud compression (G-PCC), an international standard designed by MPEG, provides a generic framework for compressing diverse types of point clouds while ensuring interoperability across applications and devices. However, G-PCC underperforms compared to recent deep learning-based PCC methods despite its lower computational power consumption. To enhance the efficiency of G-PCC without sacrificing its interoperability or computational flexibility, we propose a novel preprocessing framework that integrates a compression-oriented voxelization network with a differentiable G-PCC surrogate model, jointly optimized in the training phase. The surrogate model mimics the rate-distortion behaviour of the non-differentiable G-PCC codec, enabling end-to-end gradient propagation. The versatile voxelization network adaptively transforms input point clouds using learning-based voxelization and effectively manipulates point clouds via global scaling, fine-grained pruning, and point-level editing for rate-distortion trade-offs. During inference, only the lightweight voxelization network is appended to the G-PCC encoder, requiring no modifications to the decoder, thus introducing no computational overhead for end users. Extensive experiments demonstrate a 38.84% average BD-rate reduction over G-PCC. By bridging classical codecs with deep learning, this work offers a practical pathway to enhance legacy compression standards while preserving their backward compatibility, making it ideal for real-world deployment.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes