LGMay 14, 2025

Exploiting the Potential Supervision Information of Clean Samples in Partial Label Learning

arXiv:2505.09354v1h-index: 3Pattern Recognition
Originality Incremental advance
AI Analysis

This work addresses disambiguation challenges in partial label learning, offering an incremental improvement by leveraging clean samples for better guidance.

The paper tackles the problem of disambiguation in partial label learning by exploiting clean samples to enhance candidate confidence, resulting in performance improvements across multiple state-of-the-art methods on synthetic and real-world datasets.

Diminishing the impact of false-positive labels is critical for conducting disambiguation in partial label learning. However, the existing disambiguation strategies mainly focus on exploiting the characteristics of individual partial label instances while neglecting the strong supervision information of clean samples randomly lying in the datasets. In this work, we show that clean samples can be collected to offer guidance and enhance the confidence of the most possible candidates. Motivated by the manner of the differentiable count loss strat- egy and the K-Nearest-Neighbor algorithm, we proposed a new calibration strategy called CleanSE. Specifically, we attribute the most reliable candidates with higher significance under the assumption that for each clean sample, if its label is one of the candidates of its nearest neighbor in the representation space, it is more likely to be the ground truth of its neighbor. Moreover, clean samples offer help in characterizing the sample distributions by restricting the label counts of each label to a specific interval. Extensive experiments on 3 synthetic benchmarks and 5 real-world PLL datasets showed this calibration strategy can be applied to most of the state-of-the-art PLL methods as well as enhance their performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes