LGCVOct 13, 2020

Which Model to Transfer? Finding the Needle in the Growing Haystack

arXiv:2010.06402v232 citations
AI Analysis

This addresses a practical bottleneck for practitioners and researchers in computer vision who need to choose models from growing repositories, though it is incremental as it builds on existing transfer learning methods.

The paper tackles the problem of efficiently selecting a pre-trained model from large repositories for transfer learning in computer vision, showing that existing strategies can lead to high regret, and proposes a hybrid search strategy that outperforms them on 19 diverse tasks.

Transfer learning has been recently popularized as a data-efficient alternative to training models from scratch, in particular for computer vision tasks where it provides a remarkably solid baseline. The emergence of rich model repositories, such as TensorFlow Hub, enables the practitioners and researchers to unleash the potential of these models across a wide range of downstream tasks. As these repositories keep growing exponentially, efficiently selecting a good model for the task at hand becomes paramount. We provide a formalization of this problem through a familiar notion of regret and introduce the predominant strategies, namely task-agnostic (e.g. ranking models by their ImageNet performance) and task-aware search strategies (such as linear or kNN evaluation). We conduct a large-scale empirical study and show that both task-agnostic and task-aware methods can yield high regret. We then propose a simple and computationally efficient hybrid search strategy which outperforms the existing approaches. We highlight the practical benefits of the proposed solution on a set of 19 diverse vision tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes