CVJul 10, 2021

A Weakly-Supervised Depth Estimation Network Using Attention Mechanism

arXiv:2107.04819v11 citations
Originality Incremental advance
AI Analysis

This addresses the problem of needing accurate labels for depth estimation, which is incremental as it builds on existing weakly-supervised approaches.

The paper tackles monocular depth estimation with inaccurate labels by proposing ANUW, a weakly-supervised network using attention mechanisms, which achieves superior results compared to state-of-the-art methods on three public datasets.

Monocular depth estimation (MDE) is a fundamental task in many applications such as scene understanding and reconstruction. However, most of the existing methods rely on accurately labeled datasets. A weakly-supervised framework based on attention nested U-net (ANU) named as ANUW is introduced in this paper for cases with wrong labels. The ANUW is trained end-to-end to convert an input single RGB image into a depth image. It consists of a dense residual network structure, an adaptive weight channel attention (AWCA) module, a patch second non-local (PSNL) module and a soft label generation method. The dense residual network is the main body of the network to encode and decode the input. The AWCA module can adaptively adjust the channel weights to extract important features. The PSNL module implements the spatial attention mechanism through a second-order non-local method. The proposed soft label generation method uses the prior knowledge of the dataset to produce soft labels to replace false ones. The proposed ANUW is trained on a defective monocular depth dataset and the trained model is tested on three public datasets, and the results demonstrate the superiority of ANUW in comparison with the state-of-the-art MDE methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes