LGCVJan 19, 2025

Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation Space

arXiv:2501.11053v12 citationsh-index: 5AAAI
Originality Incremental advance
AI Analysis

This addresses a real-world limitation in noisy label learning for machine learning practitioners, moving beyond closed-set assumptions to handle unknown classes, though it is incremental in extending existing frameworks.

The paper tackles the problem of learning with noisy labels in open-world scenarios where unknown classes introduce open-set noise, proposing a dual-space joint learning method that improves average accuracy by 4.55% and AUROC by 6.17% on CIFAR80N.

Learning with Noisy Labels (LNL) aims to improve the model generalization when facing data with noisy labels, and existing methods generally assume that noisy labels come from known classes, called closed-set noise. However, in real-world scenarios, noisy labels from similar unknown classes, i.e., open-set noise, may occur during the training and inference stage. Such open-world noisy labels may significantly impact the performance of LNL methods. In this study, we propose a novel dual-space joint learning method to robustly handle the open-world noise. To mitigate model overfitting on closed-set and open-set noises, a dual representation space is constructed by two networks. One is a projection network that learns shared representations in the prototype space, while the other is a One-Vs-All (OVA) network that makes predictions using unique semantic representations in the class-independent space. Then, bi-level contrastive learning and consistency regularization are introduced in two spaces to enhance the detection capability for data with unknown classes. To benefit from the memorization effects across different types of samples, class-independent margin criteria are designed for sample identification, which selects clean samples, weights closed-set noise, and filters open-set noise effectively. Extensive experiments demonstrate that our method outperforms the state-of-the-art methods and achieves an average accuracy improvement of 4.55\% and an AUROC improvement of 6.17\% on CIFAR80N.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes