CVAILGNov 27, 2025

3D-Consistent Multi-View Editing by Diffusion Guidance

arXiv:2511.22228v1
Originality Incremental advance
AI Analysis

This addresses the inconsistency issue in multi-view editing for 3D scene representations, which is incremental as it builds on existing diffusion-based editing methods.

The paper tackles the problem of geometric and photometric inconsistencies in multi-view image editing for 3D representations like NeRFs or Gaussian Splat models, proposing a training-free diffusion framework that enforces consistency through a loss function, resulting in significantly improved 3D consistency and high-quality edits with sharp details and strong fidelity to text prompts.

Recent advancements in diffusion models have greatly improved text-based image editing, yet methods that edit images independently often produce geometrically and photometrically inconsistent results across different views of the same scene. Such inconsistencies are particularly problematic for editing of 3D representations such as NeRFs or Gaussian Splat models. We propose a training-free diffusion framework that enforces multi-view consistency during the image editing process. The key assumption is that corresponding points in the unedited images should undergo similar transformations after editing. To achieve this, we introduce a consistency loss that guides the diffusion sampling toward coherent edits. The framework is flexible and can be combined with widely varying image editing methods, supporting both dense and sparse multi-view editing setups. Experimental results show that our approach significantly improves 3D consistency compared to existing multi-view editing methods. We also show that this increased consistency enables high-quality Gaussian Splat editing with sharp details and strong fidelity to user-specified text prompts. Please refer to our project page for video results: https://3d-consistent-editing.github.io/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes