CVNov 26, 2024

Distractor-free Generalizable 3D Gaussian Splatting

arXiv:2411.17605v29 citationsh-index: 14Has Code
Originality Incremental advance
AI Analysis

It addresses a previously unexplored challenge in 3D reconstruction for scenes with distractors, though it appears incremental as it builds on generalizable 3DGS.

The paper tackles the problem of distractor data causing 3D inconsistency and training instability in generalizable 3D Gaussian Splatting, achieving superior mask prediction accuracy compared to existing scene-specific methods.

We present DGGS, a novel framework that addresses the previously unexplored challenge: $\textbf{Distractor-free Generalizable 3D Gaussian Splatting}$ (3DGS). It mitigates 3D inconsistency and training instability caused by distractor data in the cross-scenes generalizable train setting while enabling feedforward inference for 3DGS and distractor masks from references in the unseen scenes. To achieve these objectives, DGGS proposes a scene-agnostic reference-based mask prediction and refinement module during the training phase, effectively eliminating the impact of distractor on training stability. Moreover, we combat distractor-induced artifacts and holes at inference time through a novel two-stage inference framework for references scoring and re-selection, complemented by a distractor pruning mechanism that further removes residual distractor 3DGS-primitive influences. Extensive feedforward experiments on the real and our synthetic data show DGGS's reconstruction capability when dealing with novel distractor scenes. Moreover, our generalizable mask prediction even achieves an accuracy superior to existing scene-specific training methods. Homepage is https://github.com/bbbbby-99/DGGS.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes