CVLGDec 12, 2024

BayesAdapter: enhanced uncertainty estimation in CLIP few-shot adaptation

arXiv:2412.09718v26 citationsh-index: 40Has Code
AI Analysis

This work addresses the need for reliable uncertainty estimation in CLIP adapters for safe deployment in real-world scenarios, representing an incremental improvement over existing methods.

The paper tackles the problem of poor uncertainty estimation in CLIP adapters for few-shot adaptation, showing that BayesAdapter, a Bayesian inference-based method, achieves high-quality uncertainty estimates with improved calibration and selective classification.

The emergence of large pre-trained vision-language models (VLMs) represents a paradigm shift in machine learning, with unprecedented results in a broad span of visual recognition tasks. CLIP, one of the most popular VLMs, has exhibited remarkable zero-shot and transfer learning capabilities in classification. To transfer CLIP to downstream tasks, adapters constitute a parameter-efficient approach that avoids backpropagation through the large model (unlike related prompt learning methods). However, CLIP adapters have been developed to target discriminative performance, and the quality of their uncertainty estimates has been overlooked. In this work we show that the discriminative performance of state-of-the-art CLIP adapters does not always correlate with their uncertainty estimation capabilities, which are essential for a safe deployment in real-world scenarios. We also demonstrate that one of such adapters is obtained through MAP inference from a more general probabilistic framework. Based on this observation we introduce BayesAdapter, which leverages Bayesian inference to estimate a full probability distribution instead of a single point, better capturing the variability inherent in the parameter space. In a comprehensive empirical evaluation we show that our approach obtains high quality uncertainty estimates in the predictions, standing out in calibration and selective classification. Our code will be publicly available upon acceptance of the paper.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes