LGDec 10, 2021

Boosting Active Learning via Improving Test Performance

Tianyang Wang, Xingjian Li, Pengkun Yang, Guosheng Hu, Xiangrui Zeng, Siyu Huang, Cheng-Zhong Xu, Min Xu

arXiv:2112.05683v213.643 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the core challenge in active learning of data selection for practitioners, though it appears incremental as it builds on existing gradient-based approaches.

The authors tackled the problem of selecting which unlabeled data to annotate in active learning by proving that selecting data with higher gradient norm leads to better test performance, and they proposed two practical schemes (expected-gradnorm and entropy-gradnorm) to implement this. Their method achieved superior performance against state-of-the-art methods on image classification, semantic segmentation, and a cellular imaging task.

Central to active learning (AL) is what data should be selected for annotation. Existing works attempt to select highly uncertain or informative data for annotation. Nevertheless, it remains unclear how selected data impacts the test performance of the task model used in AL. In this work, we explore such an impact by theoretically proving that selecting unlabeled data of higher gradient norm leads to a lower upper-bound of test loss, resulting in better test performance. However, due to the lack of label information, directly computing gradient norm for unlabeled data is infeasible. To address this challenge, we propose two schemes, namely expected-gradnorm and entropy-gradnorm. The former computes the gradient norm by constructing an expected empirical loss while the latter constructs an unsupervised loss with entropy. Furthermore, we integrate the two schemes in a universal AL framework. We evaluate our method on classical image classification and semantic segmentation tasks. To demonstrate its competency in domain applications and its robustness to noise, we also validate our method on a cellular imaging analysis task, namely cryo-Electron Tomography subtomogram classification. Results demonstrate that our method achieves superior performance against the state of the art. Our source code is available at https://github.com/xulabs/aitom/blob/master/doc/projects/al_gradnorm.md.

View on arXiv PDF Code

Similar