CVAug 5, 2023

Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs

Yanqi Bao, Yuxin Li, Jing Huo, Tianyu Ding, Xinyue Liang, Wenbin Li, Yang Gao

arXiv:2308.02908v19.815 citationsh-index: 19Has Code

Originality Incremental advance

AI Analysis

This addresses rendering quality issues in 3D scene reconstruction for computer vision applications, but it is incremental as it builds on existing NeRF-S methods.

The paper tackles the problem of over-fitting and foggy surfaces in Neural Radiance Fields from sparse inputs (NeRF-S) by analyzing root causes as 'WHERE' and 'HOW' confusion, and presents WaH-NeRF, a framework that outperforms previous methods in this setting.

Neural Radiance Fields from Sparse input} (NeRF-S) have shown great potential in synthesizing novel views with a limited number of observed viewpoints. However, due to the inherent limitations of sparse inputs and the gap between non-adjacent views, rendering results often suffer from over-fitting and foggy surfaces, a phenomenon we refer to as "CONFUSION" during volume rendering. In this paper, we analyze the root cause of this confusion and attribute it to two fundamental questions: "WHERE" and "HOW". To this end, we present a novel learning framework, WaH-NeRF, which effectively mitigates confusion by tackling the following challenges: (i)"WHERE" to Sample? in NeRF-S -- we introduce a Deformable Sampling strategy and a Weight-based Mutual Information Loss to address sample-position confusion arising from the limited number of viewpoints; and (ii) "HOW" to Predict? in NeRF-S -- we propose a Semi-Supervised NeRF learning Paradigm based on pose perturbation and a Pixel-Patch Correspondence Loss to alleviate prediction confusion caused by the disparity between training and testing viewpoints. By integrating our proposed modules and loss functions, WaH-NeRF outperforms previous methods under the NeRF-S setting. Code is available https://github.com/bbbbby-99/WaH-NeRF.

View on arXiv PDF Code

Similar