CVLGApr 8

Holistic Optimal Label Selection for Robust Prompt Learning under Partial Labels

arXiv:2604.0661451.8h-index: 5
Predicted impact top 67% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses a domain-specific issue for researchers and practitioners using prompt learning in weakly supervised vision-language tasks, offering a practical solution that is incremental in nature.

The paper tackled the problem of label ambiguity and insufficient supervision in prompt learning when only partial labels are available, proposing Holistic Optimal Label Selection (HopS) which improved performance on eight benchmark datasets and outperformed all baselines.

Prompt learning has gained significant attention as a parameter-efficient approach for adapting large pre-trained vision-language models to downstream tasks. However, when only partial labels are available, its performance is often limited by label ambiguity and insufficient supervisory information. To address this issue, we propose Holistic Optimal Label Selection (HopS), leveraging the generalization ability of pre-trained feature encoders through two complementary strategies. First, we design a local density-based filter that selects the top frequent labels from the nearest neighbors' candidate sets and uses the softmax scores to identify the most plausible label, capturing structural regularities in the feature space. Second, we introduce a global selection objective based on optimal transport that maps the uniform sampling distribution to the candidate label distributions across a batch. By minimizing the expected transport cost, it can determine the most likely label assignments. These two strategies work together to provide robust label selection from both local and global perspectives. Extensive experiments on eight benchmark datasets show that HopS consistently improves performance under partial supervision and outperforms all baselines. Those results highlight the merit of holistic label selection and offer a practical solution for prompt learning in weakly supervised settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes