CVROMar 17, 2024

Compact 3D Gaussian Splatting For Dense Visual SLAM

arXiv:2403.11247v256 citationsh-index: 11
AI Analysis

This work addresses memory and speed limitations in dense visual SLAM for robotics and AR/VR applications, representing an incremental improvement over existing methods.

The paper tackles the high memory and slow training of 3D Gaussian-based SLAM by proposing a compact system that reduces redundant ellipsoids and compresses geometric attributes, achieving faster training and rendering while maintaining state-of-the-art quality.

Recent work has shown that 3D Gaussian-based SLAM enables high-quality reconstruction, accurate pose estimation, and real-time rendering of scenes. However, these approaches are built on a tremendous number of redundant 3D Gaussian ellipsoids, leading to high memory and storage costs, and slow training speed. To address the limitation, we propose a compact 3D Gaussian Splatting SLAM system that reduces the number and the parameter size of Gaussian ellipsoids. A sliding window-based masking strategy is first proposed to reduce the redundant ellipsoids. Then we observe that the covariance matrix (geometry) of most 3D Gaussian ellipsoids are extremely similar, which motivates a novel geometry codebook to compress 3D Gaussian geometric attributes, i.e., the parameters. Robust and accurate pose estimation is achieved by a global bundle adjustment method with reprojection loss. Extensive experiments demonstrate that our method achieves faster training and rendering speed while maintaining the state-of-the-art (SOTA) quality of the scene representation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes