CVLGNENov 24, 2014

Scale-Invariant Convolutional Neural Networks

arXiv:1411.6369v1150 citations
Originality Incremental advance
AI Analysis

This addresses scale variation issues in computer vision for applications like object recognition, though it is an incremental improvement over existing multi-column methods.

The paper tackles the limited scale tolerance of CNNs by proposing SiCNN, a multi-column architecture with shared filter parameters across scales, which achieves strong robustness to object scale variations without increasing model size.

Even though convolutional neural networks (CNN) has achieved near-human performance in various computer vision tasks, its ability to tolerate scale variations is limited. The popular practise is making the model bigger first, and then train it with data augmentation using extensive scale-jittering. In this paper, we propose a scaleinvariant convolutional neural network (SiCNN), a modeldesigned to incorporate multi-scale feature exaction and classification into the network structure. SiCNN uses a multi-column architecture, with each column focusing on a particular scale. Unlike previous multi-column strategies, these columns share the same set of filter parameters by a scale transformation among them. This design deals with scale variation without blowing up the model size. Experimental results show that SiCNN detects features at various scales, and the classification result exhibits strong robustness against object scale variations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes