CVAILGNov 15, 2024

COOD: Concept-based Zero-shot OOD Detection

arXiv:2411.13578v15 citationsh-index: 13
Originality Highly original
AI Analysis

This addresses a critical gap in real-world OOD detection for multi-label tasks, offering a novel solution with strong performance gains.

The paper tackles the problem of detecting out-of-distribution (OOD) samples in multi-label settings without retraining, introducing COOD, a zero-shot framework that achieves approximately 95% average AUROC on VOC and COCO datasets.

How can models effectively detect out-of-distribution (OOD) samples in complex, multi-label settings without extensive retraining? Existing OOD detection methods struggle to capture the intricate semantic relationships and label co-occurrences inherent in multi-label settings, often requiring large amounts of training data and failing to generalize to unseen label combinations. While large language models have revolutionized zero-shot OOD detection, they primarily focus on single-label scenarios, leaving a critical gap in handling real-world tasks where samples can be associated with multiple interdependent labels. To address these challenges, we introduce COOD, a novel zero-shot multi-label OOD detection framework. COOD leverages pre-trained vision-language models, enhancing them with a concept-based label expansion strategy and a new scoring function. By enriching the semantic space with both positive and negative concepts for each label, our approach models complex label dependencies, precisely differentiating OOD samples without the need for additional training. Extensive experiments demonstrate that our method significantly outperforms existing approaches, achieving approximately 95% average AUROC on both VOC and COCO datasets, while maintaining robust performance across varying numbers of labels and different types of OOD samples.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes