CVJan 5, 2019

Adaptive Fusion for RGB-D Salient Object Detection

arXiv:1901.01369v2219 citations
AI Analysis

This work addresses the problem of improving object detection accuracy in multi-modal images for computer vision applications, representing an incremental advancement.

The paper tackles RGB-D salient object detection by proposing an adaptive fusion scheme to combine saliency predictions from RGB and depth modalities, resulting in outperforming state-of-the-art methods on three public datasets.

RGB-D salient object detection aims to identify the most visually distinctive objects in a pair of color and depth images. Based upon an observation that most of the salient objects may stand out at least in one modality, this paper proposes an adaptive fusion scheme to fuse saliency predictions generated from two modalities. Specifically, we design a two-streamed convolutional neural network (CNN), each of which extracts features and predicts a saliency map from either RGB or depth modality. Then, a saliency fusion module learns a switch map that is used to adaptively fuse the predicted saliency maps. A loss function composed of saliency supervision, switch map supervision, and edge-preserving constraints is designed to make full supervision, and the entire network is trained in an end-to-end manner. Benefited from the adaptive fusion strategy and the edge-preserving constraint, our approach outperforms state-of-the-art methods on three publicly available datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes