CVAug 15, 2018

SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection

arXiv:1808.04974v162 citations
Originality Incremental advance
AI Analysis

This addresses scale robustness in object detection for computer vision applications, representing an incremental improvement.

The paper tackles the problem of scale variation in CNN-based object detection by proposing a Scale Aware Network (SAN) that maps features onto a scale-invariant subspace, resulting in enhanced detection accuracy on VOC PASCAL and MS COCO datasets with a slight increase in computing time.

Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space can be completely different according to a scale and this difference makes it hard for the classifier to learn samples. We propose a Scale Aware Network (SAN) that maps the convolutional features from the different scales onto a scale-invariant subspace to make CNN-based detection methods more robust to the scale variation, and also construct a unique learning method which considers purely the relationship between channels without the spatial information for the efficient learning of SAN. To show the validity of our method, we visualize how convolutional features change according to the scale through a channel activation matrix and experimentally show that SAN reduces the feature differences in the scale space. We evaluate our method on VOC PASCAL and MS COCO dataset. We demonstrate SAN by conducting several experiments on structures and parameters. The proposed SAN can be generally applied to many CNN-based detection methods to enhance the detection accuracy with a slight increase in the computing time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes