CVLGNEMay 7, 2015

Object detection via a multi-region & semantic segmentation-aware CNN model

arXiv:1505.01749v3771 citations
Originality Incremental advance
AI Analysis

This improves object detection accuracy for computer vision applications, representing a strong incremental advance.

The paper tackles object detection by proposing a multi-region CNN with semantic segmentation-aware features, achieving mAP of 78.2% on PASCAL VOC2007 and 73.9% on PASCAL VOC2012, surpassing prior work.

We propose an object detection system that relies on a multi-region deep convolutional neural network (CNN) that also encodes semantic segmentation-aware features. The resulting CNN-based representation aims at capturing a diverse set of discriminative appearance factors and exhibits localization sensitivity that is essential for accurate object localization. We exploit the above properties of our recognition module by integrating it on an iterative localization mechanism that alternates between scoring a box proposal and refining its location with a deep CNN regression model. Thanks to the efficient use of our modules, we detect objects with very high localization accuracy. On the detection challenges of PASCAL VOC2007 and PASCAL VOC2012 we achieve mAP of 78.2% and 73.9% correspondingly, surpassing any other published work by a significant margin.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes