CVJul 7, 2025

Mastering Regional 3DGS: Locating, Initializing, and Editing with Diverse 2D Priors

arXiv:2507.05426v1h-index: 6
Originality Incremental advance
AI Analysis

This addresses the challenge of targeted 3D scene manipulation for applications like virtual reality or content creation, though it is incremental as it builds on existing 3DGS and 2D prior methods.

The paper tackles the problem of precise regional editing in 3D Gaussian Splatting scenes by leveraging 2D diffusion and depth priors to locate, initialize, and refine edits, achieving state-of-the-art performance with up to a 4× speedup.

Many 3D scene editing tasks focus on modifying local regions rather than the entire scene, except for some global applications like style transfer, and in the context of 3D Gaussian Splatting (3DGS), where scenes are represented by a series of Gaussians, this structure allows for precise regional edits, offering enhanced control over specific areas of the scene; however, the challenge lies in the fact that 3D semantic parsing often underperforms compared to its 2D counterpart, making targeted manipulations within 3D spaces more difficult and limiting the fidelity of edits, which we address by leveraging 2D diffusion editing to accurately identify modification regions in each view, followed by inverse rendering for 3D localization, then refining the frontal view and initializing a coarse 3DGS with consistent views and approximate shapes derived from depth maps predicted by a 2D foundation model, thereby supporting an iterative, view-consistent editing process that gradually enhances structural details and textures to ensure coherence across perspectives. Experiments demonstrate that our method achieves state-of-the-art performance while delivering up to a $4\times$ speedup, providing a more efficient and effective approach to 3D scene local editing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes