LG CVApr 2, 2023

Adversary-Aware Partial label learning with Label distillation

arXiv:2304.00498v12.0h-index: 68

Originality Incremental advance

AI Analysis

This work addresses privacy-preserving machine learning for data collected from human subjects, offering a robust solution to a domain-specific challenge with incremental improvements over existing methods.

The paper tackles the problem of learning from noisy partial labels, where rival labels are introduced to protect participant privacy, by proposing an adversary-aware partial label learning method with an adversarial teacher within momentum disambiguation algorithm. The method achieves promising results on CIFAR10, CIFAR100, and CUB200 datasets, demonstrating high resiliency to label noise transition matrix choices.

To ensure that the data collected from human subjects is entrusted with a secret, rival labels are introduced to conceal the information provided by the participants on purpose. The corresponding learning task can be formulated as a noisy partial-label learning problem. However, conventional partial-label learning (PLL) methods are still vulnerable to the high ratio of noisy partial labels, especially in a large labelling space. To learn a more robust model, we present Adversary-Aware Partial Label Learning and introduce the $\textit{rival}$, a set of noisy labels, to the collection of candidate labels for each instance. By introducing the rival label, the predictive distribution of PLL is factorised such that a handy predictive label is achieved with less uncertainty coming from the transition matrix, assuming the rival generation process is known. Nonetheless, the predictive accuracy is still insufficient to produce an sufficiently accurate positive sample set to leverage the clustering effect of the contrastive loss function. Moreover, the inclusion of rivals also brings an inconsistency issue for the classifier and risk function due to the intractability of the transition matrix. Consequently, an adversarial teacher within momentum (ATM) disambiguation algorithm is proposed to cope with the situation, allowing us to obtain a provably consistent classifier and risk function. In addition, our method has shown high resiliency to the choice of the label noise transition matrix. Extensive experiments demonstrate that our method achieves promising results on the CIFAR10, CIFAR100 and CUB200 datasets.

View on arXiv PDF

Similar