CVAINov 26, 2024

MVBoost: Boost 3D Reconstruction with Multi-View Refinement

arXiv:2411.17772v21 citationsh-index: 31CVPR
Originality Incremental advance
AI Analysis

This addresses the data scarcity issue in 3D reconstruction for computer vision applications, but it is incremental as it builds on existing multi-view and 3D models.

The paper tackles the problem of limited generalization in 3D reconstruction due to scarce datasets by proposing MVBoost, a framework that generates pseudo-GT data through multi-view refinement, achieving superior reconstruction results and robust generalization compared to prior works.

Recent advancements in 3D object reconstruction have been remarkable, yet most current 3D models rely heavily on existing 3D datasets. The scarcity of diverse 3D datasets results in limited generalization capabilities of 3D reconstruction models. In this paper, we propose a novel framework for boosting 3D reconstruction with multi-view refinement (MVBoost) by generating pseudo-GT data. The key of MVBoost is combining the advantages of the high accuracy of the multi-view generation model and the consistency of the 3D reconstruction model to create a reliable data source. Specifically, given a single-view input image, we employ a multi-view diffusion model to generate multiple views, followed by a large 3D reconstruction model to produce consistent 3D data. MVBoost then adaptively refines these multi-view images, rendered from the consistent 3D data, to build a large-scale multi-view dataset for training a feed-forward 3D reconstruction model. Additionally, the input view optimization is designed to optimize the corresponding viewpoints based on the user's input image, ensuring that the most important viewpoint is accurately tailored to the user's needs. Extensive evaluations demonstrate that our method achieves superior reconstruction results and robust generalization compared to prior works.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes