CVOct 5, 2022

Granularity-aware Adaptation for Image Retrieval over Multiple Tasks

arXiv:2210.02254v19 citationsh-index: 35
Originality Incremental advance
AI Analysis

This addresses the need for versatile image search models that can operate across diverse domains without labeled data, though it is incremental as it builds on existing self-supervised learning methods.

The paper tackles the problem of adapting a pretrained image retrieval model to handle multiple heterogeneous tasks using only unlabeled images, achieving improvements in zero-shot performance and matching or exceeding a task-aware oracle in some cases.

Strong image search models can be learned for a specific domain, ie. set of labels, provided that some labeled images of that domain are available. A practical visual search model, however, should be versatile enough to solve multiple retrieval tasks simultaneously, even if those cover very different specialized domains. Additionally, it should be able to benefit from even unlabeled images from these various retrieval tasks. This is the more practical scenario that we consider in this paper. We address it with the proposed Grappa, an approach that starts from a strong pretrained model, and adapts it to tackle multiple retrieval tasks concurrently, using only unlabeled images from the different task domains. We extend the pretrained model with multiple independently trained sets of adaptors that use pseudo-label sets of different sizes, effectively mimicking different pseudo-granularities. We reconcile all adaptor sets into a single unified model suited for all retrieval tasks by learning fusion layers that we guide by propagating pseudo-granularity attentions across neighbors in the feature space. Results on a benchmark composed of six heterogeneous retrieval tasks show that the unsupervised Grappa model improves the zero-shot performance of a state-of-the-art self-supervised learning model, and in some places reaches or improves over a task label-aware oracle that selects the most fitting pseudo-granularity per task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes