CVJun 18, 2019

Learning with Average Precision: Training Image Retrieval with a Listwise Loss

Jerome Revaud, Jon Almazan, Rafael Sampaio de Rezende, Cesar Roberto de Souza

arXiv:1906.07589v130.8430 citations

Originality Highly original

AI Analysis

This addresses a key bottleneck in image retrieval for computer vision applications, offering a more efficient and effective training method without the need for engineering tricks.

The paper tackles the problem of suboptimal mean average precision (mAP) in image retrieval by directly optimizing global mAP using a listwise loss, achieving new state-of-the-art results on standard benchmarks.

Image retrieval can be formulated as a ranking problem where the goal is to order database images by decreasing similarity to the query. Recent deep models for image retrieval have outperformed traditional methods by leveraging ranking-tailored loss functions, but important theoretical and practical problems remain. First, rather than directly optimizing the global ranking, they minimize an upper-bound on the essential loss, which does not necessarily result in an optimal mean average precision (mAP). Second, these methods require significant engineering efforts to work well, e.g. special pre-training and hard-negative mining. In this paper we propose instead to directly optimize the global mAP by leveraging recent advances in listwise loss formulations. Using a histogram binning approximation, the AP can be differentiated and thus employed to end-to-end learning. Compared to existing losses, the proposed method considers thousands of images simultaneously at each iteration and eliminates the need for ad hoc tricks. It also establishes a new state of the art on many standard retrieval benchmarks. Models and evaluation scripts have been made available at https://europe.naverlabs.com/Deep-Image-Retrieval/

View on arXiv PDF

Similar