CVLGMLJul 24, 2020

Hard negative examples are hard, but useful

arXiv:2007.12749v2157 citations
AI Analysis

This work addresses a key bottleneck in distance metric learning for image retrieval, particularly benefiting domains with high intra-class variance, and is incremental as it builds on existing triplet loss methods.

The paper tackled the problem of triplet loss training failing with hard negative examples, which are crucial for capturing semantic similarity, and proposed a simple fix to the loss function that enables feasible optimization with these examples, leading to more generalizable features and state-of-the-art image retrieval results on datasets with high intra-class variance.

Triplet loss is an extremely common approach to distance metric learning. Representations of images from the same class are optimized to be mapped closer together in an embedding space than representations of images from different classes. Much work on triplet losses focuses on selecting the most useful triplets of images to consider, with strategies that select dissimilar examples from the same class or similar examples from different classes. The consensus of previous research is that optimizing with the \textit{hardest} negative examples leads to bad training behavior. That's a problem -- these hardest negatives are literally the cases where the distance metric fails to capture semantic similarity. In this paper, we characterize the space of triplets and derive why hard negatives make triplet loss training fail. We offer a simple fix to the loss function and show that, with this fix, optimizing with hard negative examples becomes feasible. This leads to more generalizable features, and image retrieval results that outperform state of the art for datasets with high intra-class variance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes