CVMay 19, 2020

Associating Multi-Scale Receptive Fields for Fine-grained Recognition

arXiv:2005.09153v1Has Code
AI Analysis

This addresses the challenge of fine-grained recognition for computer vision applications, but it is incremental as it builds on existing non-local modules.

The paper tackles the problem of modeling interactions between multi-scale part features in fine-grained image recognition by proposing a cross-layer non-local (CNL) module, achieving or surpassing state-of-the-art results on three benchmark datasets.

Extracting and fusing part features have become the key of fined-grained image recognition. Recently, Non-local (NL) module has shown excellent improvement in image recognition. However, it lacks the mechanism to model the interactions between multi-scale part features, which is vital for fine-grained recognition. In this paper, we propose a novel cross-layer non-local (CNL) module to associate multi-scale receptive fields by two operations. First, CNL computes correlations between features of a query layer and all response layers. Second, all response features are weighted according to the correlations and are added to the query features. Due to the interactions of cross-layer features, our model builds spatial dependencies among multi-level layers and learns more discriminative features. In addition, we can reduce the aggregation cost if we set low-dimensional deep layer as query layer. Experiments are conducted to show our model achieves or surpasses state-of-the-art results on three benchmark datasets of fine-grained classification. Our codes can be found at github.com/FouriYe/CNL-ICIP2020.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes