LGNov 17, 2020

Improving Calibration in Deep Metric Learning With Cross-Example Softmax

arXiv:2011.08824v13 citations
AI Analysis

This work addresses the problem of improving similarity metric calibration for image retrieval systems, which could benefit applications relying on interpretable distance measures.

This paper proposes Cross-Example Softmax, a novel loss function that combines top-k and threshold relevancy properties in deep metric learning. It encourages all queries to be closer to their matching images than to all non-matching images, leading to a more calibrated similarity metric. The method improves global calibration and retrieval performance on Conceptual Captions and Flickr30k.

Modern image retrieval systems increasingly rely on the use of deep neural networks to learn embedding spaces in which distance encodes the relevance between a given query and image. In this setting, existing approaches tend to emphasize one of two properties. Triplet-based methods capture top-$k$ relevancy, where all top-$k$ scoring documents are assumed to be relevant to a given query Pairwise contrastive models capture threshold relevancy, where all documents scoring higher than some threshold are assumed to be relevant. In this paper, we propose Cross-Example Softmax which combines the properties of top-$k$ and threshold relevancy. In each iteration, the proposed loss encourages all queries to be closer to their matching images than all queries are to all non-matching images. This leads to a globally more calibrated similarity metric and makes distance more interpretable as an absolute measure of relevance. We further introduce Cross-Example Negative Mining, in which each pair is compared to the hardest negative comparisons across the entire batch. Empirically, we show in a series of experiments on Conceptual Captions and Flickr30k, that the proposed method effectively improves global calibration and also retrieval performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes