LGMay 26, 2022

Deep Active Learning with Noise Stability

Xingjian Li, Pengkun Yang, Yangcheng Gu, Xueying Zhan, Tianyang Wang, Min Xu, Chengzhong Xu

arXiv:2205.13340v213.030 citationsh-index: 60

Originality Incremental advance

AI Analysis

This addresses the problem of inefficient and complex pipelines in active learning for practitioners in fields like computer vision and NLP, though it appears incremental as it builds on existing uncertainty estimation methods.

The paper tackles the challenge of uncertainty estimation for unlabeled data in deep active learning by proposing a novel algorithm that uses noise stability to measure output derivation under parameter perturbations, achieving competitive performance against state-of-the-art baselines.

Uncertainty estimation for unlabeled data is crucial to active learning. With a deep neural network employed as the backbone model, the data selection process is highly challenging due to the potential over-confidence of the model inference. Existing methods resort to special learning fashions (e.g. adversarial) or auxiliary models to address this challenge. This tends to result in complex and inefficient pipelines, which would render the methods impractical. In this work, we propose a novel algorithm that leverages noise stability to estimate data uncertainty. The key idea is to measure the output derivation from the original observation when the model parameters are randomly perturbed by noise. We provide theoretical analyses by leveraging the small Gaussian noise theory and demonstrate that our method favors a subset with large and diverse gradients. Our method is generally applicable in various tasks, including computer vision, natural language processing, and structural data analysis. It achieves competitive performance compared against state-of-the-art active learning baselines.

View on arXiv PDF

Similar