CVJan 30, 2015

SHOE: Supervised Hashing with Output Embeddings

arXiv:1502.00030v1
Originality Incremental advance
AI Analysis

This work addresses image retrieval efficiency and accuracy for applications like fine-grained classification, though it appears incremental as it builds on existing supervised hashing schemes.

The paper tackles the problem of supervised binary encoding for image retrieval by learning projections that incorporate similarity between classes from output embeddings, resulting in improved retrieval accuracy over state-of-the-art methods on datasets like CUB-2011, SUN-Attribute, and ImageNet ILSVRC 2010, with an impressive compression ratio of 1024 on CNN features.

We present a supervised binary encoding scheme for image retrieval that learns projections by taking into account similarity between classes obtained from output embeddings. Our motivation is that binary hash codes learned in this way improve both the visual quality of retrieval results and existing supervised hashing schemes. We employ a sequential greedy optimization that learns relationship aware projections by minimizing the difference between inner products of binary codes and output embedding vectors. We develop a joint optimization framework to learn projections which improve the accuracy of supervised hashing over the current state of the art with respect to standard and sibling evaluation metrics. We further boost performance by applying the supervised dimensionality reduction technique on kernelized input CNN features. Experiments are performed on three datasets: CUB-2011, SUN-Attribute and ImageNet ILSVRC 2010. As a by-product of our method, we show that using a simple k-nn pooling classifier with our discriminative codes improves over the complex classification models on fine grained datasets like CUB and offer an impressive compression ratio of 1024 on CNN features.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes