CVJun 30, 2025

WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image

arXiv:2506.23518v21 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses the challenge of generating consistent novel views from a single image for applications in computer vision and graphics, representing an incremental improvement over existing diffusion-based methods.

The paper tackles the problem of maintaining structural coherence (view consistency) in novel view synthesis from a single image, proposing a training-free method that enhances diffusion models with adaptive attention manipulation and noise reinitialization using view-guided warping, resulting in improved view consistency across various diffusion models as demonstrated through a comprehensive metric framework.

Generating high-quality novel views of a scene from a single image requires maintaining structural coherence across different views, referred to as view consistency. While diffusion models have driven advancements in novel view synthesis, they still struggle to preserve spatial continuity across views. Diffusion models have been combined with 3D models to address the issue, but such approaches lack efficiency due to their complex multi-step pipelines. This paper proposes a novel view-consistent image generation method which utilizes diffusion models without additional modules. Our key idea is to enhance diffusion models with a training-free method that enables adaptive attention manipulation and noise reinitialization by leveraging view-guided warping to ensure view consistency. Through our comprehensive metric framework suitable for novel-view datasets, we show that our method improves view consistency across various diffusion models, demonstrating its broader applicability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes