CVNov 2, 2022

Decoupled Cross-Scale Cross-View Interaction for Stereo Image Enhancement in The Dark

arXiv:2211.00859v315 citationsh-index: 72
Originality Incremental advance
AI Analysis

This work solves the problem of enhancing visually unpleasant stereo images captured in dark conditions for applications like autonomous driving or 3D vision, but it is incremental as it builds on existing methods with specific improvements.

The paper tackles the problem of low-light stereo image enhancement by addressing insufficient cross-view interaction and lack of long-range dependency, resulting in a method that achieves state-of-the-art performance on multiple datasets with better illumination adjustment and detail recovery.

Low-light stereo image enhancement (LLSIE) is a relatively new task to enhance the quality of visually unpleasant stereo images captured in dark condition. However, current methods achieve inferior performance on detail recovery and illumination adjustment. We find it is because: 1) the insufficient single-scale inter-view interaction makes the cross-view cues unable to be fully exploited; 2) lacking long-range dependency leads to the inability to deal with the spatial long-range effects caused by illumination degradation. To alleviate such limitations, we propose a LLSIE model termed Decoupled Cross-scale Cross-view Interaction Network (DCI-Net). Specifically, we present a decoupled interaction module (DIM) that aims for sufficient dual-view information interaction. DIM decouples the dual-view information exchange into discovering multi-scale cross-view correlations and further exploring cross-scale information flow. Besides, we present a spatial-channel information mining block (SIMB) for intra-view feature extraction, and the benefits are twofold. One is the long-range dependency capture to build spatial long-range relationship, and the other is expanded channel information refinement that enhances information flow in channel dimension. Extensive experiments on Flickr1024, KITTI 2012, KITTI 2015 and Middlebury datasets show that our method obtains better illumination adjustment and detail recovery, and achieves SOTA performance compared to other related methods. Our codes, datasets and models will be publicly available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes