CVApr 6, 2019

Deep Surface Normal Estimation with Hierarchical RGB-D Fusion

Jin Zeng, Yanfeng Tong, Yunmu Huang, Qiong Yan, Wenxiu Sun, Jing Chen, Yongtian Wang

arXiv:1904.03405v217.577 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses a fundamental scene understanding problem for applications using commodity RGB-D cameras, representing an incremental improvement over existing methods.

The paper tackles surface normal estimation from a single RGB-D image by proposing a hierarchical fusion network with adaptive feature re-weighting and a hybrid multi-scale loss function, achieving state-of-the-art performance in experiments.

The growing availability of commodity RGB-D cameras has boosted the applications in the field of scene understanding. However, as a fundamental scene understanding task, surface normal estimation from RGB-D data lacks thorough investigation. In this paper, a hierarchical fusion network with adaptive feature re-weighting is proposed for surface normal estimation from a single RGB-D image. Specifically, the features from color image and depth are successively integrated at multiple scales to ensure global surface smoothness while preserving visually salient details. Meanwhile, the depth features are re-weighted with a confidence map estimated from depth before merging into the color branch to avoid artifacts caused by input depth corruption. Additionally, a hybrid multi-scale loss function is designed to learn accurate normal estimation given noisy ground-truth dataset. Extensive experimental results validate the effectiveness of the fusion strategy and the loss design, outperforming state-of-the-art normal estimation schemes.

View on arXiv PDF Code

Similar