CVMMNov 11, 2024

A Hierarchical Compression Technique for 3D Gaussian Splatting Compression

arXiv:2411.06976v220 citationsh-index: 17ICASSP
Originality Incremental advance
AI Analysis

This addresses storage and transmission challenges for 3D GS applications, but it appears incremental as it focuses on compressing GS data itself rather than developing new scene representations.

The paper tackles the problem of large data size in 3D Gaussian Splatting (GS) for novel view synthesis, which hinders storage and transmission, by proposing a Hierarchical GS Compression (HGSC) technique that achieves over 4.5 times data size reduction compared to state-of-the-art methods on small scenes datasets while maintaining visual quality.

3D Gaussian Splatting (GS) demonstrates excellent rendering quality and generation speed in novel view synthesis. However, substantial data size poses challenges for storage and transmission, making 3D GS compression an essential technology. Current 3D GS compression research primarily focuses on developing more compact scene representations, such as converting explicit 3D GS data into implicit forms. In contrast, compression of the GS data itself has hardly been explored. To address this gap, we propose a Hierarchical GS Compression (HGSC) technique. Initially, we prune unimportant Gaussians based on importance scores derived from both global and local significance, effectively reducing redundancy while maintaining visual quality. An Octree structure is used to compress 3D positions. Based on the 3D GS Octree, we implement a hierarchical attribute compression strategy by employing a KD-tree to partition the 3D GS into multiple blocks. We apply farthest point sampling to select anchor primitives within each block and others as non-anchor primitives with varying Levels of Details (LoDs). Anchor primitives serve as reference points for predicting non-anchor primitives across different LoDs to reduce spatial redundancy. For anchor primitives, we use the region adaptive hierarchical transform to achieve near-lossless compression of various attributes. For non-anchor primitives, each is predicted based on the k-nearest anchor primitives. To further minimize prediction errors, the reconstructed LoD and anchor primitives are combined to form new anchor primitives to predict the next LoD. Our method notably achieves superior compression quality and a significant data size reduction of over 4.5 times compared to the state-of-the-art compression method on small scenes datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes