CVApr 21, 2019

A Simple Pooling-Based Design for Real-Time Salient Object Detection

arXiv:1904.09569v1979 citations
Originality Incremental advance
AI Analysis

This work addresses real-time detection of salient objects in images, which is incremental as it builds on the U-shape architecture with novel pooling modules.

The paper tackles salient object detection by enhancing pooling in CNNs to improve location accuracy and detail sharpness, achieving over 30 FPS on 300x400 images and outperforming previous state-of-the-art methods.

We solve the problem of salient object detection by investigating how to expand the role of pooling in convolutional neural networks. Based on the U-shape architecture, we first build a global guidance module (GGM) upon the bottom-up pathway, aiming at providing layers at different feature levels the location information of potential salient objects. We further design a feature aggregation module (FAM) to make the coarse-level semantic information well fused with the fine-level features from the top-down pathway. By adding FAMs after the fusion operations in the top-down pathway, coarse-level features from the GGM can be seamlessly merged with features at various scales. These two pooling-based modules allow the high-level semantic features to be progressively refined, yielding detail enriched saliency maps. Experiment results show that our proposed approach can more accurately locate the salient objects with sharpened details and hence substantially improve the performance compared to the previous state-of-the-arts. Our approach is fast as well and can run at a speed of more than 30 FPS when processing a $300 \times 400$ image. Code can be found at http://mmcheng.net/poolnet/.

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes