CVJul 7, 2025

LoomNet: Enhancing Multi-View Image Generation via Latent Space Weaving

arXiv:2507.05499v1h-index: 28
Originality Incremental advance
AI Analysis

This addresses the challenge of spatial consistency in multi-view image generation for 3D reconstruction, with incremental improvements over existing methods.

The paper tackles the problem of generating consistent multi-view images from a single image, which often degrades 3D mesh quality in surface reconstruction, and proposes LoomNet, a multi-view diffusion architecture that produces coherent images by applying the same diffusion model multiple times in parallel to build a shared latent space, resulting in 16 high-quality views in 15 seconds and outperforming state-of-the-art methods on image quality and reconstruction metrics.

Generating consistent multi-view images from a single image remains challenging. Lack of spatial consistency often degrades 3D mesh quality in surface reconstruction. To address this, we propose LoomNet, a novel multi-view diffusion architecture that produces coherent images by applying the same diffusion model multiple times in parallel to collaboratively build and leverage a shared latent space for view consistency. Each viewpoint-specific inference generates an encoding representing its own hypothesis of the novel view from a given camera pose, which is projected onto three orthogonal planes. For each plane, encodings from all views are fused into a single aggregated plane. These aggregated planes are then processed to propagate information and interpolate missing regions, combining the hypotheses into a unified, coherent interpretation. The final latent space is then used to render consistent multi-view images. LoomNet generates 16 high-quality and coherent views in just 15 seconds. In our experiments, LoomNet outperforms state-of-the-art methods on both image quality and reconstruction metrics, also showing creativity by producing diverse, plausible novel views from the same input.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes