CVApr 14

SyncFix: Fixing 3D Reconstructions via Multi-View Synchronization

arXiv:2604.1179789.41 citationsh-index: 16
Predicted impact top 17% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For 3D reconstruction practitioners, SyncFix addresses the bottleneck of semantic and geometric inconsistencies across views, offering a practical refinement method that outperforms existing baselines.

SyncFix enforces cross-view consistency during diffusion-based refinement of 3D reconstructions, achieving state-of-the-art quality even without clean reference images. It generalizes from image pairs to arbitrary views, with quality improving as views increase.

We present SyncFix, a framework that enforces cross-view consistency during the diffusion-based refinement of reconstructed scenes. SyncFix formulates refinement as a joint latent bridge matching problem, synchronizing distorted and clean representations across multiple views to fix the semantic and geometric inconsistencies. This means SyncFix learns a joint conditional over multiple views to enforce consistency throughout the denoising trajectory. Our training is done only on image pairs, but it generalizes naturally to an arbitrary number of views during inference. Moreover, reconstruction quality improves with additional views, with diminishing returns at higher view counts. Qualitative and quantitative results demonstrate that SyncFix consistently generates high-quality reconstructions and surpasses current state-of-the-art baselines, even in the absence of clean reference images. SyncFix achieves even higher fidelity when sparse references are available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes