LGCVJan 7, 2023

Active Learning Guided by Efficient Surrogate Learners

arXiv:2301.02761v22 citationsh-index: 28
Originality Incremental advance
AI Analysis

This work addresses inefficiencies in active learning for deep learning practitioners, though it appears incremental as it builds on existing surrogate methods.

The paper tackled the problem of redundant sampling in batch-based active learning by introducing an algorithm that uses a Gaussian process surrogate to update for each new data instance, avoiding full model retraining. Experiments on four benchmark datasets showed that it yields significant enhancements, rivaling or aligning with state-of-the-art techniques.

Re-training a deep learning model each time a single data point receives a new label is impractical due to the inherent complexity of the training process. Consequently, existing active learning (AL) algorithms tend to adopt a batch-based approach where, during each AL iteration, a set of data points is collectively chosen for annotation. However, this strategy frequently leads to redundant sampling, ultimately eroding the efficacy of the labeling procedure. In this paper, we introduce a new AL algorithm that harnesses the power of a Gaussian process surrogate in conjunction with the neural network principal learner. Our proposed model adeptly updates the surrogate learner for every new data instance, enabling it to emulate and capitalize on the continuous learning dynamics of the neural network without necessitating a complete retraining of the principal model for each individual label. Experiments on four benchmark datasets demonstrate that this approach yields significant enhancements, either rivaling or aligning with the performance of state-of-the-art techniques.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes