LGCVJul 30, 2021

When Deep Learners Change Their Mind: Learning Dynamics for Active Learning

arXiv:2107.14707v111 citations
Originality Incremental advance
AI Analysis

This work addresses active learning for machine learning practitioners by improving sample selection, but it is incremental as it builds on existing informativeness-based methods.

The paper tackles the problem of active learning by addressing neural networks' overconfidence, proposing a method based on learning dynamics called label-dispersion to measure sample informativeness, and shows it achieves excellent results on two benchmark datasets.

Active learning aims to select samples to be annotated that yield the largest performance improvement for the learning algorithm. Many methods approach this problem by measuring the informativeness of samples and do this based on the certainty of the network predictions for samples. However, it is well-known that neural networks are overly confident about their prediction and are therefore an untrustworthy source to assess sample informativeness. In this paper, we propose a new informativeness-based active learning method. Our measure is derived from the learning dynamics of a neural network. More precisely we track the label assignment of the unlabeled data pool during the training of the algorithm. We capture the learning dynamics with a metric called label-dispersion, which is low when the network consistently assigns the same label to the sample during the training of the network and high when the assigned label changes frequently. We show that label-dispersion is a promising predictor of the uncertainty of the network, and show on two benchmark datasets that an active learning algorithm based on label-dispersion obtains excellent results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes