LG CVJan 12, 2023

Forgetful Active Learning with Switch Events: Efficient Sampling for Out-of-Distribution Data

Ryan Benkert, Mohit Prabhushankar, Ghassan AlRegib

arXiv:2301.05106v111.510 citationsh-index: 14

Originality Highly original

AI Analysis

This addresses the problem of robust active learning for out-of-distribution scenarios in machine learning, offering a novel approach to improve model performance in such settings.

The paper tackles the challenge of active learning with out-of-distribution data by introducing a protocol that selects samples based on learning difficulty instead of data representations, achieving up to 4.5% accuracy improvements across multiple experiments.

This paper considers deep out-of-distribution active learning. In practice, fully trained neural networks interact randomly with out-of-distribution (OOD) inputs and map aberrant samples randomly within the model representation space. Since data representations are direct manifestations of the training distribution, the data selection process plays a crucial role in outlier robustness. For paradigms such as active learning, this is especially challenging since protocols must not only improve performance on the training distribution most effectively but further render a robust representation space. However, existing strategies directly base the data selection on the data representation of the unlabeled data which is random for OOD samples by definition. For this purpose, we introduce forgetful active learning with switch events (FALSE) - a novel active learning protocol for out-of-distribution active learning. Instead of defining sample importance on the data representation directly, we formulate "informativeness" with learning difficulty during training. Specifically, we approximate how often the network "forgets" unlabeled samples and query the most "forgotten" samples for annotation. We report up to 4.5\% accuracy improvements in over 270 experiments, including four commonly used protocols, two OOD benchmarks, one in-distribution benchmark, and three different architectures.

View on arXiv PDF

Similar