CVMay 23, 2025

A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency

arXiv:2505.18024v13 citationsh-index: 3Has Code
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in stereo matching for computer vision applications, offering an incremental improvement over prior iterative methods.

The paper tackles the problem of frequency convergence inconsistency in stereo matching, where existing methods degrade high-frequency details like edges. The proposed wavelet-based framework separates high and low frequencies, achieving state-of-the-art results by ranking 1st on KITTI 2015 and 2012 leaderboards for almost all metrics.

We find that the EPE evaluation metrics of RAFT-stereo converge inconsistently in the low and high frequency regions, resulting high frequency degradation (e.g., edges and thin objects) during the iterative process. The underlying reason for the limited performance of current iterative methods is that it optimizes all frequency components together without distinguishing between high and low frequencies. We propose a wavelet-based stereo matching framework (Wavelet-Stereo) for solving frequency convergence inconsistency. Specifically, we first explicitly decompose an image into high and low frequency components using discrete wavelet transform. Then, the high-frequency and low-frequency components are fed into two different multi-scale frequency feature extractors. Finally, we propose a novel LSTM-based high-frequency preservation update operator containing an iterative frequency adapter to provide adaptive refined high-frequency features at different iteration steps by fine-tuning the initial high-frequency features. By processing high and low frequency components separately, our framework can simultaneously refine high-frequency information in edges and low-frequency information in smooth regions, which is especially suitable for challenging scenes with fine details and textures in the distance. Extensive experiments demonstrate that our Wavelet-Stereo outperforms the state-of-the-art methods and ranks 1st on both the KITTI 2015 and KITTI 2012 leaderboards for almost all metrics. We will provide code and pre-trained models to encourage further exploration, application, and development of our innovative framework (https://github.com/SIA-IDE/Wavelet-Stereo).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes