CLLGJul 22, 2025

PromptAL: Sample-Aware Dynamic Soft Prompts for Few-Shot Active Learning

arXiv:2507.16424v14 citationsh-index: 2Has CodeKnowledge-Based Systems
Originality Incremental advance
AI Analysis

This work addresses the challenge of reducing annotation costs in few-shot active learning for machine learning practitioners, though it appears incremental as it builds on existing uncertainty and diversity methods.

The paper tackles the problem of suboptimal sample selection in few-shot active learning due to misaligned empirical and target distributions, and introduces PromptAL, a hybrid framework that uses sample-aware dynamic soft prompts to adjust the decision boundary and improve selection, achieving superior performance over nine baselines on six in-domain and three out-of-domain datasets.

Active learning (AL) aims to optimize model training and reduce annotation costs by selecting the most informative samples for labeling. Typically, AL methods rely on the empirical distribution of labeled data to define the decision boundary and perform uncertainty or diversity estimation, subsequently identifying potential high-quality samples. In few-shot scenarios, the empirical distribution often diverges significantly from the target distribution, causing the decision boundary to shift away from its optimal position. However, existing methods overlook the role of unlabeled samples in enhancing the empirical distribution to better align with the target distribution, resulting in a suboptimal decision boundary and the selection of samples that inadequately represent the target distribution. To address this, we propose a hybrid AL framework, termed \textbf{PromptAL} (Sample-Aware Dynamic Soft \textbf{Prompts} for Few-Shot \textbf{A}ctive \textbf{L}earning). This framework accounts for the contribution of each unlabeled data point in aligning the current empirical distribution with the target distribution, thereby optimizing the decision boundary. Specifically, PromptAL first leverages unlabeled data to construct sample-aware dynamic soft prompts that adjust the model's predictive distribution and decision boundary. Subsequently, based on the adjusted decision boundary, it integrates uncertainty estimation with both global and local diversity to select high-quality samples that more accurately represent the target distribution. Experimental results on six in-domain and three out-of-domain datasets show that PromptAL achieves superior performance over nine baselines. Our codebase is openly accessible.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes