GRCVLGSep 30, 2025

GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts

arXiv:2509.26055v15 citationsh-index: 10IEEE Trans Vis Comput Graph
Originality Incremental advance
AI Analysis

This provides a powerful tool for detailed and user-driven 3D scene editing, offering significant improvements over traditional methods.

The paper tackles the problem of 3D scene editing by introducing GaussEdit, a framework that uses text and image prompts to achieve high-quality edits, resulting in superior editing accuracy, visual fidelity, and processing speed compared to existing methods.

This paper presents GaussEdit, a framework for adaptive 3D scene editing guided by text and image prompts. GaussEdit leverages 3D Gaussian Splatting as its backbone for scene representation, enabling convenient Region of Interest selection and efficient editing through a three-stage process. The first stage involves initializing the 3D Gaussians to ensure high-quality edits. The second stage employs an Adaptive Global-Local Optimization strategy to balance global scene coherence and detailed local edits and a category-guided regularization technique to alleviate the Janus problem. The final stage enhances the texture of the edited objects using a sophisticated image-to-image synthesis technique, ensuring that the results are visually realistic and align closely with the given prompts. Our experimental results demonstrate that GaussEdit surpasses existing methods in editing accuracy, visual fidelity, and processing speed. By successfully embedding user-specified concepts into 3D scenes, GaussEdit is a powerful tool for detailed and user-driven 3D scene editing, offering significant improvements over traditional methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes