CVIVAug 6, 2024

Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement

arXiv:2408.02966v29 citationsh-index: 11
Originality Highly original
AI Analysis

This work addresses the problem of efficient point cloud compression for applications like VR/AR and autonomous driving, representing a novel approach rather than an incremental improvement.

The paper tackles the challenge of compressing unordered 3D point clouds by proposing a method that uses KNN-based local context for adaptive arithmetic coding and a dual-layer architecture with INR-based refinement, achieving two orders of magnitude reduction in model complexity and coding latency compared to SOTA methods.

Compressing a set of unordered points is far more challenging than compressing images/videos of regular sample grids, because of the difficulties in characterizing neighboring relations in an irregular layout of points. Many researchers resort to voxelization to introduce regularity, but this approach suffers from quantization loss. In this research, we use the KNN method to determine the neighborhoods of raw surface points. This gives us a means to determine the spatial context in which the latent features of 3D points are compressed by arithmetic coding. As such, the conditional probability model is adaptive to local geometry, leading to significant rate reduction. Additionally, we propose a dual-layer architecture where a non-learning base layer reconstructs the main structures of the point cloud at low complexity, while a learned refinement layer focuses on preserving fine details. This design leads to reductions in model complexity and coding latency by two orders of magnitude compared to SOTA methods. Moreover, we incorporate an implicit neural representation (INR) into the refinement layer, allowing the decoder to sample points on the underlying surface at arbitrary densities. This work is the first to effectively exploit content-aware local contexts for compressing irregular raw point clouds, achieving high rate-distortion performance, low complexity, and the ability to function as an arbitrary-scale upsampling network simultaneously.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes