CVLGMar 4, 2025

Joint Out-of-Distribution Filtering and Data Discovery Active Learning

arXiv:2503.02491v110 citationsh-index: 12CVPR
Originality Highly original
AI Analysis

This work addresses the challenge of efficient data selection for deep learning when faced with unknown categories and out-of-distribution samples, which is incremental as it combines existing ideas into a novel method.

The paper tackles the problem of active learning in real-world scenarios with incomplete data knowledge by jointly filtering out-of-distribution data and discovering new categories, resulting in Joda achieving the highest accuracy and best balance in experiments across 18 configurations and 3 metrics.

As the data demand for deep learning models increases, active learning (AL) becomes essential to strategically select samples for labeling, which maximizes data efficiency and reduces training costs. Real-world scenarios necessitate the consideration of incomplete data knowledge within AL. Prior works address handling out-of-distribution (OOD) data, while another research direction has focused on category discovery. However, a combined analysis of real-world considerations combining AL with out-of-distribution data and category discovery remains unexplored. To address this gap, we propose Joint Out-of-distribution filtering and data Discovery Active learning (Joda) , to uniquely address both challenges simultaneously by filtering out OOD data before selecting candidates for labeling. In contrast to previous methods, we deeply entangle the training procedure with filter and selection to construct a common feature space that aligns known and novel categories while separating OOD samples. Unlike previous works, Joda is highly efficient and completely omits auxiliary models and training access to the unlabeled pool for filtering or selection. In extensive experiments on 18 configurations and 3 metrics, \ours{} consistently achieves the highest accuracy with the best class discovery to OOD filtering balance compared to state-of-the-art competitor approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes