LGCLMay 16, 2023

On Dataset Transferability in Active Learning for Transformers

arXiv:2305.09807v2224 citations
Originality Incremental advance
AI Analysis

This addresses the problem of labeling cost efficiency for NLP practitioners by showing that active learning datasets can be reused across models, though it is incremental in scope.

The study investigated whether active learning gains for one transformer model transfer to another in text classification, finding that datasets built with similar acquisition sequences are highly transferable regardless of the model used.

Active learning (AL) aims to reduce labeling costs by querying the examples most beneficial for model learning. While the effectiveness of AL for fine-tuning transformer-based pre-trained language models (PLMs) has been demonstrated, it is less clear to what extent the AL gains obtained with one model transfer to others. We consider the problem of transferability of actively acquired datasets in text classification and investigate whether AL gains persist when a dataset built using AL coupled with a specific PLM is used to train a different PLM. We link the AL dataset transferability to the similarity of instances queried by the different PLMs and show that AL methods with similar acquisition sequences produce highly transferable datasets regardless of the models used. Additionally, we show that the similarity of acquisition sequences is influenced more by the choice of the AL method than the choice of the model.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes