CVApr 14, 2021

Discrete Cosine Transform Network for Guided Depth Map Super-Resolution

arXiv:2104.06977v3139 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses depth map enhancement for applications like robotics and AR/VR, but it is incremental as it builds on existing guided super-resolution methods.

The paper tackles guided depth super-resolution by proposing a Discrete Cosine Transform Network (DCTNet) to improve high-resolution depth map reconstruction using RGB guidance, achieving state-of-the-art performance with fewer parameters.

Guided depth super-resolution (GDSR) is an essential topic in multi-modal image processing, which reconstructs high-resolution (HR) depth maps from low-resolution ones collected with suboptimal conditions with the help of HR RGB images of the same scene. To solve the challenges in interpreting the working mechanism, extracting cross-modal features and RGB texture over-transferred, we propose a novel Discrete Cosine Transform Network (DCTNet) to alleviate the problems from three aspects. First, the Discrete Cosine Transform (DCT) module reconstructs the multi-channel HR depth features by using DCT to solve the channel-wise optimization problem derived from the image domain. Second, we introduce a semi-coupled feature extraction module that uses shared convolutional kernels to extract common information and private kernels to extract modality-specific information. Third, we employ an edge attention mechanism to highlight the contours informative for guided upsampling. Extensive quantitative and qualitative evaluations demonstrate the effectiveness of our DCTNet, which outperforms previous state-of-the-art methods with a relatively small number of parameters. The code is available at \url{https://github.com/Zhaozixiang1228/GDSR-DCTNet}.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes