CVAIIRApr 7, 2024

Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval

arXiv:2404.04998v10.1613 citationsh-index: 30Has CodeAAAI
AI Analysis50

This addresses the label-hungry issue in image retrieval for applications relying on user-uploaded images with informal tags, though it is incremental as it adapts existing methods to a new supervision type.

The paper tackles the problem of learning deep quantization for image retrieval without ground-truth labels by using weakly tagged images, achieving state-of-the-art performance in weakly-supervised compact coding.

Deep quantization methods have shown high efficiency on large-scale image retrieval. However, current models heavily rely on ground-truth information, hindering the application of quantization in label-hungry scenarios. A more realistic demand is to learn from inexhaustible uploaded images that are associated with informal tags provided by amateur users. Though such sketchy tags do not obviously reveal the labels, they actually contain useful semantic information for supervising deep quantization. To this end, we propose Weakly-Supervised Deep Hyperspherical Quantization (WSDHQ), which is the first work to learn deep quantization from weakly tagged images. Specifically, 1) we use word embeddings to represent the tags and enhance their semantic information based on a tag correlation graph. 2) To better preserve semantic information in quantization codes and reduce quantization error, we jointly learn semantics-preserving embeddings and supervised quantizer on hypersphere by employing a well-designed fusion layer and tailor-made loss functions. Extensive experiments show that WSDHQ can achieve state-of-art performance on weakly-supervised compact coding. Code is available at https://github.com/gimpong/AAAI21-WSDHQ.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes