CVJan 9, 2024

RomniStereo: Recurrent Omnidirectional Stereo Matching

arXiv:2401.04345v29 citationsh-index: 9Has CodeIEEE Robot Autom Lett
AI Analysis

This work addresses depth sensing in 360-degree images, offering a more efficient and accurate solution for applications like VR and robotics, though it is incremental as it builds on existing recurrent techniques.

The paper tackles omnidirectional stereo matching by proposing RomniStereo, which adapts recurrent methods from 2D matching to improve efficiency and accuracy, achieving a 40.7% reduction in average MAE over prior state-of-the-art across five datasets.

Omnidirectional stereo matching (OSM) is an essential and reliable means for $360^{\circ}$ depth sensing. However, following earlier works on conventional stereo matching, prior state-of-the-art (SOTA) methods rely on a 3D encoder-decoder block to regularize the cost volume, causing the whole system complicated and sub-optimal results. Recently, the Recurrent All-pairs Field Transforms (RAFT) based approach employs the recurrent update in 2D and has efficiently improved image-matching tasks, ie, optical flow, and stereo matching. To bridge the gap between OSM and RAFT, we mainly propose an opposite adaptive weighting scheme to seamlessly transform the outputs of spherical sweeping of OSM into the required inputs for the recurrent update, thus creating a recurrent omnidirectional stereo matching (RomniStereo) algorithm. Furthermore, we introduce two techniques, ie, grid embedding and adaptive context feature generation, which also contribute to RomniStereo's performance. Our best model improves the average MAE metric by 40.7\% over the previous SOTA baseline across five datasets. When visualizing the results, our models demonstrate clear advantages on both synthetic and realistic examples. The code is available at \url{https://github.com/HalleyJiang/RomniStereo}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes