GRCVMar 18, 2024

View-Consistent 3D Editing with Gaussian Splatting

arXiv:2403.11868v1079 citationsh-index: 34ECCV
Originality Incremental advance
AI Analysis

This addresses a critical bottleneck in 3D editing for computer vision and graphics applications, offering an incremental improvement over existing methods.

The paper tackles the problem of multi-view inconsistency in 3D Gaussian Splatting editing, which causes mode collapse and artifacts, and introduces VcEdit to ensure consistency and improve editing quality.

The advent of 3D Gaussian Splatting (3DGS) has revolutionized 3D editing, offering efficient, high-fidelity rendering and enabling precise local manipulations. Currently, diffusion-based 2D editing models are harnessed to modify multi-view rendered images, which then guide the editing of 3DGS models. However, this approach faces a critical issue of multi-view inconsistency, where the guidance images exhibit significant discrepancies across views, leading to mode collapse and visual artifacts of 3DGS. To this end, we introduce View-consistent Editing (VcEdit), a novel framework that seamlessly incorporates 3DGS into image editing processes, ensuring multi-view consistency in edited guidance images and effectively mitigating mode collapse issues. VcEdit employs two innovative consistency modules: the Cross-attention Consistency Module and the Editing Consistency Module, both designed to reduce inconsistencies in edited images. By incorporating these consistency modules into an iterative pattern, VcEdit proficiently resolves the issue of multi-view inconsistency, facilitating high-quality 3DGS editing across a diverse range of scenes. Further video results are shown in http://vcedit.github.io.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes