CVJul 29, 2024

BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation

arXiv:2407.20437v24 citationsh-index: 3
AI Analysis

This work addresses a specific bottleneck in self-supervised depth estimation for applications like autonomous driving and robotics, representing a strong incremental improvement over existing methods.

The paper tackles the problem of self-supervised monocular depth estimation by exploiting larger baselines, which typically degrade depth quality due to issues like brightness changes and occlusion. The proposed BaseBoostDepth method achieves state-of-the-art performance on KITTI and SYNS-patches datasets across multiple metrics without increasing test-time computational complexity.

In the domain of multi-baseline stereo, the conventional understanding is that, in general, increasing baseline separation substantially enhances the accuracy of depth estimation. However, prevailing self-supervised depth estimation architectures primarily use minimal frame separation and a constrained stereo baseline. Larger frame separations can be employed; however, we show this to result in diminished depth quality due to various factors, including significant changes in brightness, and increased areas of occlusion. In response to these challenges, our proposed method, BaseBoostDepth, incorporates a curriculum learning-inspired optimization strategy to effectively leverage larger frame separations. However, we show that our curriculum learning-inspired strategy alone does not suffice, as larger baselines still cause pose estimation drifts. Therefore, we introduce incremental pose estimation to enhance the accuracy of pose estimations, resulting in significant improvements across all depth metrics. Additionally, to improve the robustness of the model, we introduce error-induced reconstructions, which optimize reconstructions with added error to the pose estimations. Ultimately, our final depth network achieves state-of-the-art performance on KITTI and SYNS-patches datasets across image-based, edge-based, and point cloud-based metrics without increasing computational complexity at test time. The project website can be found at https://kieran514.github.io/BaseBoostDepth-Project.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes