CVJul 26, 2020

Learning and aggregating deep local descriptors for instance-level recognition

arXiv:2007.13172v1114 citations
AI Analysis

This work addresses the challenge of scalable image search for applications like visual recognition, offering an incremental improvement by enhancing local descriptors over global ones.

The authors tackled the problem of instance-level recognition by proposing an efficient method to learn deep local descriptors, achieving state-of-the-art performance with lower memory requirements compared to existing methods, even with a small backbone network like ResNet18.

We propose an efficient method to learn deep local descriptors for instance-level recognition. The training only requires examples of positive and negative image pairs and is performed as metric learning of sum-pooled global image descriptors. At inference, the local descriptors are provided by the activations of internal components of the network. We demonstrate why such an approach learns local descriptors that work well for image similarity estimation with classical efficient match kernel methods. The experimental validation studies the trade-off between performance and memory requirements of the state-of-the-art image search approach based on match kernels. Compared to existing local descriptors, the proposed ones perform better in two instance-level recognition tasks and keep memory requirements lower. We experimentally show that global descriptors are not effective enough at large scale and that local descriptors are essential. We achieve state-of-the-art performance, in some cases even with a backbone network as small as ResNet18.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes