CVAIDec 30, 2024

KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences

arXiv:2412.20767v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the inefficiency in 3D reconstruction for computer vision applications, though it is incremental as it builds on existing 3DGS techniques.

The paper tackles the problem of slow training times in 3D Gaussian Splatting for monocular image sequences by proposing a keyframe-centric method that uses SfM for rough poses and refines them with 3DGS, reducing training time from hours to minutes while improving novel view synthesis and pose estimation accuracy.

Reconstructing high-quality 3D models from sparse 2D images has garnered significant attention in computer vision. Recently, 3D Gaussian Splatting (3DGS) has gained prominence due to its explicit representation with efficient training speed and real-time rendering capabilities. However, existing methods still heavily depend on accurate camera poses for reconstruction. Although some recent approaches attempt to train 3DGS models without the Structure-from-Motion (SfM) preprocessing from monocular video datasets, these methods suffer from prolonged training times, making them impractical for many applications. In this paper, we present an efficient framework that operates without any depth or matching model. Our approach initially uses SfM to quickly obtain rough camera poses within seconds, and then refines these poses by leveraging the dense representation in 3DGS. This framework effectively addresses the issue of long training times. Additionally, we integrate the densification process with joint refinement and propose a coarse-to-fine frequency-aware densification to reconstruct different levels of details. This approach prevents camera pose estimation from being trapped in local minima or drifting due to high-frequency signals. Our method significantly reduces training time from hours to minutes while achieving more accurate novel view synthesis and camera pose estimation compared to previous methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes