CVDec 6, 2024

Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction

arXiv:2412.04887v219 citationsh-index: 2
AI Analysis

This work addresses efficiency and accuracy challenges in 3D reconstruction for applications like virtual reality and robotics, representing an incremental improvement over existing hybrid methods.

The paper tackles the problem of high memory consumption and storage overhead in large-scale 3D scene reconstruction using Gaussian Splatting by proposing Momentum-GS, which uses momentum-based self-distillation and block weighting to improve consistency and accuracy, achieving a 12.8% improvement in LPIPS over CityGaussian with fewer divided blocks.

3D Gaussian Splatting has demonstrated notable success in large-scale scene reconstruction, but challenges persist due to high training memory consumption and storage overhead. Hybrid representations that integrate implicit and explicit features offer a way to mitigate these limitations. However, when applied in parallelized block-wise training, two critical issues arise since reconstruction accuracy deteriorates due to reduced data diversity when training each block independently, and parallel training restricts the number of divided blocks to the available number of GPUs. To address these issues, we propose Momentum-GS, a novel approach that leverages momentum-based self-distillation to promote consistency and accuracy across the blocks while decoupling the number of blocks from the physical GPU count. Our method maintains a teacher Gaussian decoder updated with momentum, ensuring a stable reference during training. This teacher provides each block with global guidance in a self-distillation manner, promoting spatial consistency in reconstruction. To further ensure consistency across the blocks, we incorporate block weighting, dynamically adjusting each block's weight according to its reconstruction accuracy. Extensive experiments on large-scale scenes show that our method consistently outperforms existing techniques, achieving a 12.8% improvement in LPIPS over CityGaussian with much fewer divided blocks and establishing a new state of the art. Project page: https://jixuan-fan.github.io/Momentum-GS_Page/

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes