CVApr 15, 2024

Digging into contrastive learning for robust depth estimation with diffusion models

arXiv:2404.09831v423 citationsh-index: 18Has CodeMM
Originality Incremental advance
AI Analysis

This work addresses robust depth estimation for autonomous systems in complex environments, representing an incremental improvement with a novel hybrid method.

The paper tackles the problem of unreliable depth estimation under adverse weather conditions using diffusion models, proposing D4RD with a custom contrastive learning mode that integrates knowledge distillation to improve robustness, achieving state-of-the-art performance on synthetic and real-world datasets.

Recently, diffusion-based depth estimation methods have drawn widespread attention due to their elegant denoising patterns and promising performance. However, they are typically unreliable under adverse conditions prevalent in real-world scenarios, such as rainy, snowy, etc. In this paper, we propose a novel robust depth estimation method called D4RD, featuring a custom contrastive learning mode tailored for diffusion models to mitigate performance degradation in complex environments. Concretely, we integrate the strength of knowledge distillation into contrastive learning, building the `trinity' contrastive scheme. This scheme utilizes the sampled noise of the forward diffusion process as a natural reference, guiding the predicted noise in diverse scenes toward a more stable and precise optimum. Moreover, we extend noise-level trinity to encompass more generic feature and image levels, establishing a multi-level contrast to distribute the burden of robust perception across the overall network. Before addressing complex scenarios, we enhance the stability of the baseline diffusion model with three straightforward yet effective improvements, which facilitate convergence and remove depth outliers. Extensive experiments demonstrate that D4RD surpasses existing state-of-the-art solutions on synthetic corruption datasets and real-world weather conditions. Source code and data are available at \url{https://github.com/wangjiyuan9/D4RD}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes