LG AIFeb 20, 2025

Correcting Noisy Multilabel Predictions: Modeling Label Noise through Latent Space Shifts

Weipeng Huang, Qin Li, Yang Xiao, Cheng Qiao, Tie Cai, Junwei Liang, Neil J. Hurley, Guangyuan Piao

arXiv:2502.14281v37.12 citationsh-index: 13Has Code

Originality Incremental advance

AI Analysis

This addresses the issue of label noise in multilabel classification, which is less explored than multiclass, offering a computationally efficient solution that can be combined with other techniques, though it appears incremental in scope.

The paper tackles the problem of noisy labels in multilabel classification by proposing a post-correction method that models label noise as latent space shifts, achieving consistent improvements over independent models and outperforming existing methods across various noisy settings.

Noise in data appears to be inevitable in most real-world machine learning applications and would cause severe overfitting problems. Not only can data features contain noise, but labels are also prone to be noisy due to human input. In this paper, rather than noisy label learning in multiclass classifications, we instead focus on the less explored area of noisy label learning for multilabel classifications. Specifically, we investigate the post-correction of predictions generated from classifiers learned with noisy labels. The reasons are two-fold. Firstly, this approach can directly work with the trained models to save computational resources. Secondly, it could be applied on top of other noisy label correction techniques to achieve further improvements. To handle this problem, we appeal to deep generative approaches that are possible for uncertainty estimation. Our model posits that label noise arises from a stochastic shift in the latent variable, providing a more robust and beneficial means for noisy learning. We develop both unsupervised and semi-supervised learning methods for our model. The extensive empirical study presents solid evidence to that our approach is able to consistently improve the independent models and performs better than a number of existing methods across various noisy label settings. Moreover, a comprehensive empirical analysis of the proposed method is carried out to validate its robustness, including sensitivity analysis and an ablation study, among other elements.

View on arXiv PDF Code

Similar