CL AI LGSep 8, 2021

Active Learning by Acquiring Contrastive Examples

Katerina Margatina, Giorgos Vernikos, Loïc Barrault, Nikolaos Aletras

arXiv:2109.03764v132.3694 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of efficiently selecting informative data points for labeling in active learning, which is incremental as it combines uncertainty and diversity sampling.

The paper tackled the problem of active learning by proposing an acquisition function that selects contrastive examples, which are data points similar in feature space but with maximally different predictive likelihoods. The result showed that CAL performed consistently better or equal to the best baselines across four natural language understanding tasks and seven datasets, including in-domain and out-of-domain data.

Common acquisition functions for active learning use either uncertainty or diversity sampling, aiming to select difficult and diverse data points from the pool of unlabeled data, respectively. In this work, leveraging the best of both worlds, we propose an acquisition function that opts for selecting \textit{contrastive examples}, i.e. data points that are similar in the model feature space and yet the model outputs maximally different predictive likelihoods. We compare our approach, CAL (Contrastive Active Learning), with a diverse set of acquisition functions in four natural language understanding tasks and seven datasets. Our experiments show that CAL performs consistently better or equal than the best performing baseline across all tasks, on both in-domain and out-of-domain data. We also conduct an extensive ablation study of our method and we further analyze all actively acquired datasets showing that CAL achieves a better trade-off between uncertainty and diversity compared to other strategies.

View on arXiv PDF Code

Similar