CVApr 6, 2019

Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

arXiv:1904.03405v277 citations
Originality Incremental advance
AI Analysis

This work addresses a fundamental scene understanding problem for applications using commodity RGB-D cameras, representing an incremental improvement over existing methods.

The paper tackles surface normal estimation from a single RGB-D image by proposing a hierarchical fusion network with adaptive feature re-weighting and a hybrid multi-scale loss function, achieving state-of-the-art performance in experiments.

The growing availability of commodity RGB-D cameras has boosted the applications in the field of scene understanding. However, as a fundamental scene understanding task, surface normal estimation from RGB-D data lacks thorough investigation. In this paper, a hierarchical fusion network with adaptive feature re-weighting is proposed for surface normal estimation from a single RGB-D image. Specifically, the features from color image and depth are successively integrated at multiple scales to ensure global surface smoothness while preserving visually salient details. Meanwhile, the depth features are re-weighted with a confidence map estimated from depth before merging into the color branch to avoid artifacts caused by input depth corruption. Additionally, a hybrid multi-scale loss function is designed to learn accurate normal estimation given noisy ground-truth dataset. Extensive experimental results validate the effectiveness of the fusion strategy and the loss design, outperforming state-of-the-art normal estimation schemes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes