CVJun 18, 2019

Learning with Average Precision: Training Image Retrieval with a Listwise Loss

arXiv:1906.07589v1426 citations
Originality Highly original
AI Analysis

This addresses a key bottleneck in image retrieval for computer vision applications, offering a more efficient and effective training method without the need for engineering tricks.

The paper tackles the problem of suboptimal mean average precision (mAP) in image retrieval by directly optimizing global mAP using a listwise loss, achieving new state-of-the-art results on standard benchmarks.

Image retrieval can be formulated as a ranking problem where the goal is to order database images by decreasing similarity to the query. Recent deep models for image retrieval have outperformed traditional methods by leveraging ranking-tailored loss functions, but important theoretical and practical problems remain. First, rather than directly optimizing the global ranking, they minimize an upper-bound on the essential loss, which does not necessarily result in an optimal mean average precision (mAP). Second, these methods require significant engineering efforts to work well, e.g. special pre-training and hard-negative mining. In this paper we propose instead to directly optimize the global mAP by leveraging recent advances in listwise loss formulations. Using a histogram binning approximation, the AP can be differentiated and thus employed to end-to-end learning. Compared to existing losses, the proposed method considers thousands of images simultaneously at each iteration and eliminates the need for ad hoc tricks. It also establishes a new state of the art on many standard retrieval benchmarks. Models and evaluation scripts have been made available at https://europe.naverlabs.com/Deep-Image-Retrieval/

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes