CVAug 9, 2023

WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields

arXiv:2308.04826v228 citationsh-index: 120
Originality Highly original
AI Analysis

This addresses the need for efficient and high-quality 3D scene reconstruction in computer vision, offering a generalizable solution that reduces the requirement for densely sampled images.

The paper tackles the problem of poor scalability in Neural Radiance Fields (NeRF) for novel view synthesis by proposing WaveNeRF, which integrates wavelet frequency decomposition into Multi-View Stereo and NeRF to achieve generalizable synthesis without per-scene optimization, achieving superior results with only three input images on benchmarks.

Neural Radiance Field (NeRF) has shown impressive performance in novel view synthesis via implicit scene representation. However, it usually suffers from poor scalability as requiring densely sampled images for each new scene. Several studies have attempted to mitigate this problem by integrating Multi-View Stereo (MVS) technique into NeRF while they still entail a cumbersome fine-tuning process for new scenes. Notably, the rendering quality will drop severely without this fine-tuning process and the errors mainly appear around the high-frequency features. In the light of this observation, we design WaveNeRF, which integrates wavelet frequency decomposition into MVS and NeRF to achieve generalizable yet high-quality synthesis without any per-scene optimization. To preserve high-frequency information when generating 3D feature volumes, WaveNeRF builds Multi-View Stereo in the Wavelet domain by integrating the discrete wavelet transform into the classical cascade MVS, which disentangles high-frequency information explicitly. With that, disentangled frequency features can be injected into classic NeRF via a novel hybrid neural renderer to yield faithful high-frequency details, and an intuitive frequency-guided sampling strategy can be designed to suppress artifacts around high-frequency regions. Extensive experiments over three widely studied benchmarks show that WaveNeRF achieves superior generalizable radiance field modeling when only given three images as input.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes