CVMay 22, 2025

Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction

arXiv:2505.16533v2h-index: 14
Originality Incremental advance
AI Analysis

This work addresses storage efficiency for dynamic scene reconstruction in free-viewpoint video applications, representing an incremental improvement over existing methods.

The paper tackles the prohibitive storage requirements in online free-viewpoint video reconstruction by proposing a Compact Gaussian Streaming framework that exploits motion properties, achieving a storage reduction of over 159x compared to 3DGStream and 14x compared to QUEEN while maintaining competitive visual fidelity and rendering speed.

3D Gaussian Splatting (3DGS) has emerged as a high-fidelity and efficient paradigm for online free-viewpoint video (FVV) reconstruction, offering viewers rapid responsiveness and immersive experiences. However, existing online methods face challenge in prohibitive storage requirements primarily due to point-wise modeling that fails to exploit the motion properties. To address this limitation, we propose a novel Compact Gaussian Streaming (ComGS) framework, leveraging the locality and consistency of motion in dynamic scene, that models object-consistent Gaussian point motion through keypoint-driven motion representation. By transmitting only the keypoint attributes, this framework provides a more storage-efficient solution. Specifically, we first identify a sparse set of motion-sensitive keypoints localized within motion regions using a viewspace gradient difference strategy. Equipped with these keypoints, we propose an adaptive motion-driven mechanism that predicts a spatial influence field for propagating keypoint motion to neighboring Gaussian points with similar motion. Moreover, ComGS adopts an error-aware correction strategy for key frame reconstruction that selectively refines erroneous regions and mitigates error accumulation without unnecessary overhead. Overall, ComGS achieves a remarkable storage reduction of over 159 X compared to 3DGStream and 14 X compared to the SOTA method QUEEN, while maintaining competitive visual fidelity and rendering speed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes