LGCVMLApr 7, 2024

On the Learnability of Out-of-distribution Detection

arXiv:2404.04865v111 citationsh-index: 10J mach learn res
Originality Incremental advance
AI Analysis

This work addresses the theoretical foundation of OOD detection, a critical problem for robust machine learning systems, though it is incremental in extending existing learning theory.

The paper investigates the learnability of out-of-distribution (OOD) detection using PAC learning theory, proving impossibility theorems under certain scenarios and providing necessary and sufficient conditions for learnability in practical settings.

Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good generalization ability is crucial for effective OOD detection algorithms, and corresponding learning theory is still an open problem. To study the generalization of OOD detection, this paper investigates the probably approximately correct (PAC) learning theory of OOD detection that fits the commonly used evaluation metrics in the literature. First, we find a necessary condition for the learnability of OOD detection. Then, using this condition, we prove several impossibility theorems for the learnability of OOD detection under some scenarios. Although the impossibility theorems are frustrating, we find that some conditions of these impossibility theorems may not hold in some practical scenarios. Based on this observation, we next give several necessary and sufficient conditions to characterize the learnability of OOD detection in some practical scenarios. Lastly, we offer theoretical support for representative OOD detection works based on our OOD theory.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes