CVDec 13, 2018

ELASTIC: Improving CNNs with Dynamic Scaling Policies

arXiv:1812.05262v263 citationsHas Code
Originality Highly original
AI Analysis

This addresses scale challenges in computer vision tasks like classification and segmentation, offering a novel approach that can be applied to various network architectures.

The paper tackles the problem of scale variation in computer vision by introducing ELASTIC, a method to learn dynamic scaling policies from data, which improved performance on ImageNet classification, MSCOCO multi-label classification, and PASCAL VOC semantic segmentation without extra computation.

Scale variation has been a challenge from traditional to modern approaches in computer vision. Most solutions to scale issues have a similar theme: a set of intuitive and manually designed policies that are generic and fixed (e.g. SIFT or feature pyramid). We argue that the scaling policy should be learned from data. In this paper, we introduce ELASTIC, a simple, efficient and yet very effective approach to learn a dynamic scale policy from data. We formulate the scaling policy as a non-linear function inside the network's structure that (a) is learned from data, (b) is instance specific, (c) does not add extra computation, and (d) can be applied on any network architecture. We applied ELASTIC to several state-of-the-art network architectures and showed consistent improvement without extra (sometimes even lower) computation on ImageNet classification, MSCOCO multi-label classification, and PASCAL VOC semantic segmentation. Our results show major improvement for images with scale challenges. Our code is available here: https://github.com/allenai/elastic

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes