CVJul 12, 2018

Video Saliency Detection by 3D Convolutional Neural Networks

arXiv:1807.04514v110 citations
Originality Incremental advance
AI Analysis

This addresses the problem of identifying salient objects in videos for computer vision applications, representing an incremental improvement.

The paper tackles video saliency detection by proposing a 3D convolutional neural network approach to extract and combine spatial and temporal features, achieving better performance than state-of-the-art methods.

Different from salient object detection methods for still images, a key challenging for video saliency detection is how to extract and combine spatial and temporal features. In this paper, we present a novel and effective approach for salient object detection for video sequences based on 3D convolutional neural networks. First, we design a 3D convolutional network (Conv3DNet) with the input as three video frame to learn the spatiotemporal features for video sequences. Then, we design a 3D deconvolutional network (Deconv3DNet) to combine the spatiotemporal features to predict the final saliency map for video sequences. Experimental results show that the proposed saliency detection model performs better in video saliency prediction compared with the state-of-the-art video saliency detection methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes