CVCYLGJun 9, 2023

How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models?

arXiv:2306.06048v357 citationsh-index: 50
Originality Incremental advance
AI Analysis

This addresses the reliability of fine-tuned models for OOD detection without OOD labels, which is important for deploying vision-language models in real-world applications, though it is incremental as it builds on existing CLIP-based methods.

The paper investigates how fine-tuning affects out-of-distribution (OOD) detection in vision-language models like CLIP for few-shot tasks, finding that proper OOD score selection, particularly the maximum concept matching (MCM) score, is crucial and that prompt learning achieves state-of-the-art OOD detection performance compared to zero-shot methods.

Recent large vision-language models such as CLIP have shown remarkable out-of-distribution (OOD) detection and generalization performance. However, their zero-shot in-distribution (ID) accuracy is often limited for downstream datasets. Recent CLIP-based fine-tuning methods such as prompt learning have demonstrated significant improvements in ID classification and OOD generalization where OOD labels are available. Nonetheless, it remains unclear whether the model is reliable to semantic shifts without OOD labels. In this paper, we aim to bridge the gap and present a comprehensive study to understand how fine-tuning impact OOD detection for few-shot downstream tasks. By framing OOD detection as multi-modal concept matching, we establish a connection between fine-tuning methods and various OOD scores. Our results suggest that a proper choice of OOD scores is essential for CLIP-based fine-tuning. In particular, the maximum concept matching (MCM) score provides a promising solution consistently. We also show that prompt learning demonstrates the state-of-the-art OOD detection performance over the zero-shot counterpart.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes