CVAIMay 24, 2024

Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification

arXiv:2405.15860v1h-index: 8
Originality Incremental advance
AI Analysis

This addresses the problem of missing labels in multi-label image classification for researchers and practitioners, offering a plug-and-play solution that is incremental but effective.

The paper tackles the challenge of training deep classifiers on partially labeled multi-label image datasets by proposing LogicMix, a Mixup variant that uses logical OR to mix unknown labels and can mix multiple samples. The method achieves state-of-the-art performance on MS-COCO, VG-200, and Pascal VOC 2007 datasets through collaboration with other techniques.

Multi-label image classification datasets are often partially labeled where many labels are missing, posing a significant challenge to training accurate deep classifiers. However, the powerful Mixup sample-mixing data augmentation cannot be well utilized to address this challenge, as it cannot perform linear interpolation on the unknown labels to construct augmented samples. In this paper, we propose LogicMix, a Mixup variant designed for such partially labeled datasets. LogicMix mixes the sample labels by logical OR so that the unknown labels can be correctly mixed by utilizing OR's logical equivalences, including the domination and identity laws. Unlike Mixup, which mixes exactly two samples, LogicMix can mix multiple ($\geq2$) partially labeled samples, constructing visually more confused augmented samples to regularize training. LogicMix is more general and effective than other compared Mixup variants in the experiments on various partially labeled dataset scenarios. Moreover, it is plug-and-play and only requires minimal computation, hence it can be easily inserted into existing frameworks to collaborate with other methods to improve model performance with a negligible impact on training time, as demonstrated through extensive experiments. In particular, through the collaboration of LogicMix, RandAugment, Curriculum Labeling, and Category-wise Fine-Tuning, we attain state-of-the-art performance on MS-COCO, VG-200, and Pascal VOC 2007 benchmarking datasets. The remarkable generality, effectiveness, collaboration, and simplicity suggest that LogicMix promises to be a popular and vital data augmentation method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes