CVMay 22, 2021

PLM: Partial Label Masking for Imbalanced Multi-label Classification

Kevin Duarte, Yogesh S. Rawat, Mubarak Shah

arXiv:2105.10782v17.324 citations

Originality Highly original

AI Analysis

This work addresses data imbalance in multi-label classification, a domain where existing methods often fail, offering a versatile solution that can be combined with other strategies.

The paper tackles the problem of neural network bias towards frequent classes in imbalanced multi-label classification by proposing Partial Label Masking (PLM), which adaptively masks labels during training to balance class ratios, resulting in improved recall on minority classes and precision on frequent classes on datasets like MultiMNIST and MSCOCO.

Neural networks trained on real-world datasets with long-tailed label distributions are biased towards frequent classes and perform poorly on infrequent classes. The imbalance in the ratio of positive and negative samples for each class skews network output probabilities further from ground-truth distributions. We propose a method, Partial Label Masking (PLM), which utilizes this ratio during training. By stochastically masking labels during loss computation, the method balances this ratio for each class, leading to improved recall on minority classes and improved precision on frequent classes. The ratio is estimated adaptively based on the network's performance by minimizing the KL divergence between predicted and ground-truth distributions. Whereas most existing approaches addressing data imbalance are mainly focused on single-label classification and do not generalize well to the multi-label case, this work proposes a general approach to solve the long-tail data imbalance issue for multi-label classification. PLM is versatile: it can be applied to most objective functions and it can be used alongside other strategies for class imbalance. Our method achieves strong performance when compared to existing methods on both multi-label (MultiMNIST and MSCOCO) and single-label (imbalanced CIFAR-10 and CIFAR-100) image classification datasets.

View on arXiv PDF

Similar