CVAISep 23, 2025

RoSe: Robust Self-supervised Stereo Matching under Adverse Weather Conditions

arXiv:2509.19165v14 citationsh-index: 2Has CodeIEEE transactions on circuits and systems for video technology (Print)
Originality Incremental advance
AI Analysis

This addresses a domain-specific problem for stereo vision systems in autonomous vehicles or robotics that must operate in varied weather, though it is incremental as it builds on existing self-supervised approaches.

The paper tackles the problem of self-supervised stereo matching performance degrading under adverse weather conditions like night, rain, and fog, by proposing a method that injects robust priors from a visual foundation model and uses scene correspondence priors, resulting in outperforming existing state-of-the-art methods.

Recent self-supervised stereo matching methods have made significant progress, but their performance significantly degrades under adverse weather conditions such as night, rain, and fog. We identify two primary weaknesses contributing to this performance degradation. First, adverse weather introduces noise and reduces visibility, making CNN-based feature extractors struggle with degraded regions like reflective and textureless areas. Second, these degraded regions can disrupt accurate pixel correspondences, leading to ineffective supervision based on the photometric consistency assumption. To address these challenges, we propose injecting robust priors derived from the visual foundation model into the CNN-based feature extractor to improve feature representation under adverse weather conditions. We then introduce scene correspondence priors to construct robust supervisory signals rather than relying solely on the photometric consistency assumption. Specifically, we create synthetic stereo datasets with realistic weather degradations. These datasets feature clear and adverse image pairs that maintain the same semantic context and disparity, preserving the scene correspondence property. With this knowledge, we propose a robust self-supervised training paradigm, consisting of two key steps: robust self-supervised scene correspondence learning and adverse weather distillation. Both steps aim to align underlying scene results from clean and adverse image pairs, thus improving model disparity estimation under adverse weather effects. Extensive experiments demonstrate the effectiveness and versatility of our proposed solution, which outperforms existing state-of-the-art self-supervised methods. Codes are available at \textcolor{blue}{https://github.com/cocowy1/RoSe-Robust-Self-supervised-Stereo-Matching-under-Adverse-Weather-Conditions}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes