CVDec 3, 2022

Multi-resolution Monocular Depth Map Fusion by Self-supervised Gradient-based Composition

arXiv:2212.01538v19 citationsh-index: 18
Originality Incremental advance
AI Analysis

This work addresses detail enhancement in depth maps for computer vision applications, representing an incremental improvement over existing fusion techniques.

The paper tackles the problem of monocular depth estimation lacking fine-grained details by proposing a self-supervised gradient-based fusion module that combines multi-resolution depth maps, resulting in state-of-the-art detail enhancement and an 80X speed improvement over existing methods.

Monocular depth estimation is a challenging problem on which deep neural networks have demonstrated great potential. However, depth maps predicted by existing deep models usually lack fine-grained details due to the convolution operations and the down-samplings in networks. We find that increasing input resolution is helpful to preserve more local details while the estimation at low resolution is more accurate globally. Therefore, we propose a novel depth map fusion module to combine the advantages of estimations with multi-resolution inputs. Instead of merging the low- and high-resolution estimations equally, we adopt the core idea of Poisson fusion, trying to implant the gradient domain of high-resolution depth into the low-resolution depth. While classic Poisson fusion requires a fusion mask as supervision, we propose a self-supervised framework based on guided image filtering. We demonstrate that this gradient-based composition performs much better at noisy immunity, compared with the state-of-the-art depth map fusion method. Our lightweight depth fusion is one-shot and runs in real-time, making our method 80X faster than a state-of-the-art depth fusion method. Quantitative evaluations demonstrate that the proposed method can be integrated into many fully convolutional monocular depth estimation backbones with a significant performance boost, leading to state-of-the-art results of detail enhancement on depth maps.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes