CVMar 19, 2018

Depth-aware CNN for RGB-D Segmentation

arXiv:1803.06791v1291 citations
Originality Incremental advance
AI Analysis

This addresses the computational and memory inefficiencies in RGB-D segmentation for computer vision applications, though it is incremental as it builds on existing CNN frameworks.

The paper tackled the problem of incorporating geometric information into CNNs for RGB-D semantic segmentation by proposing depth-aware convolution and pooling operations, which achieved competitive results on benchmarks without adding parameters.

Convolutional neural networks (CNN) are limited by the lack of capability to handle geometric information due to the fixed grid kernel structure. The availability of depth data enables progress in RGB-D semantic segmentation with CNNs. State-of-the-art methods either use depth as additional images or process spatial information in 3D volumes or point clouds. These methods suffer from high computation and memory cost. To address these issues, we present Depth-aware CNN by introducing two intuitive, flexible and effective operations: depth-aware convolution and depth-aware average pooling. By leveraging depth similarity between pixels in the process of information propagation, geometry is seamlessly incorporated into CNN. Without introducing any additional parameters, both operators can be easily integrated into existing CNNs. Extensive experiments and ablation studies on challenging RGB-D semantic segmentation benchmarks validate the effectiveness and flexibility of our approach.

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes