CVDec 30, 2025

Structure-Guided Allocation of 2D Gaussians for Image Representation and Compression

arXiv:2512.24018v13 citationsh-index: 25
Originality Incremental advance
AI Analysis

This work addresses image compression inefficiencies for applications requiring high-speed decoding, though it is incremental as it builds on existing 2DGS methods.

The paper tackles the problem of inefficient rate-distortion performance in 2D Gaussian Splatting for image compression by proposing a structure-guided allocation principle, resulting in a 43.44% BD-rate reduction on Kodak and 29.91% on DIV2K while maintaining over 1000 FPS decoding.

Recent advances in 2D Gaussian Splatting (2DGS) have demonstrated its potential as a compact image representation with millisecond-level decoding. However, existing 2DGS-based pipelines allocate representation capacity and parameter precision largely oblivious to image structure, limiting their rate-distortion (RD) efficiency at low bitrates. To address this, we propose a structure-guided allocation principle for 2DGS, which explicitly couples image structure with both representation capacity and quantization precision, while preserving native decoding speed. First, we introduce a structure-guided initialization that assigns 2D Gaussians according to spatial structural priors inherent in natural images, yielding a localized and semantically meaningful distribution. Second, during quantization-aware fine-tuning, we propose adaptive bitwidth quantization of covariance parameters, which grants higher precision to small-scale Gaussians in complex regions and lower precision elsewhere, enabling RD-aware optimization, thereby reducing redundancy without degrading edge quality. Third, we impose a geometry-consistent regularization that aligns Gaussian orientations with local gradient directions to better preserve structural details. Extensive experiments demonstrate that our approach substantially improves both the representational power and the RD performance of 2DGS while maintaining over 1000 FPS decoding. Compared with the baseline GSImage, we reduce BD-rate by 43.44% on Kodak and 29.91% on DIV2K.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes