CVDec 15, 2023

Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

arXiv:2312.09716v115 citationsh-index: 11Has CodeAAAI
Originality Incremental advance
AI Analysis

This work addresses efficiency challenges in visual retrieval for applications like image and video search, though it is incremental as it builds on existing distillation techniques.

The paper tackles the problem of balancing accuracy and efficiency in visual retrieval by proposing Whiten-MTD, a multi-teacher distillation framework that transfers knowledge from pre-trained models to a lightweight student model, achieving effective performance on image and video retrieval datasets.

Visual retrieval aims to search for the most relevant visual items, e.g., images and videos, from a candidate gallery with a given query item. Accuracy and efficiency are two competing objectives in retrieval tasks. Instead of crafting a new method pursuing further improvement on accuracy, in this paper we propose a multi-teacher distillation framework Whiten-MTD, which is able to transfer knowledge from off-the-shelf pre-trained retrieval models to a lightweight student model for efficient visual retrieval. Furthermore, we discover that the similarities obtained by different retrieval models are diversified and incommensurable, which makes it challenging to jointly distill knowledge from multiple models. Therefore, we propose to whiten the output of teacher models before fusion, which enables effective multi-teacher distillation for retrieval models. Whiten-MTD is conceptually simple and practically effective. Extensive experiments on two landmark image retrieval datasets and one video retrieval dataset demonstrate the effectiveness of our proposed method, and its good balance of retrieval performance and efficiency. Our source code is released at https://github.com/Maryeon/whiten_mtd.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes