CVJul 23, 2020

Accurate RGB-D Salient Object Detection via Collaborative Learning

arXiv:2007.11782v1178 citations
Originality Incremental advance
AI Analysis

This work addresses efficiency and accuracy issues in RGB-D saliency detection, making it more lightweight and versatile for practical applications.

The paper tackles the problem of blurry object boundaries and high computational cost in RGB-D salient object detection by proposing a collaborative learning framework that integrates edge, depth, and saliency information, achieving superior performance on seven benchmark datasets.

Benefiting from the spatial cues embedded in depth images, recent progress on RGB-D saliency detection shows impressive ability on some challenge scenarios. However, there are still two limitations. One hand is that the pooling and upsampling operations in FCNs might cause blur object boundaries. On the other hand, using an additional depth-network to extract depth features might lead to high computation and storage cost. The reliance on depth inputs during testing also limits the practical applications of current RGB-D models. In this paper, we propose a novel collaborative learning framework where edge, depth and saliency are leveraged in a more efficient way, which solves those problems tactfully. The explicitly extracted edge information goes together with saliency to give more emphasis to the salient regions and object boundaries. Depth and saliency learning is innovatively integrated into the high-level feature learning process in a mutual-benefit manner. This strategy enables the network to be free of using extra depth networks and depth inputs to make inference. To this end, it makes our model more lightweight, faster and more versatile. Experiment results on seven benchmark datasets show its superior performance.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes