LGAICVAug 26, 2024

May the Forgetting Be with You: Alternate Replay for Learning with Noisy Labels

arXiv:2408.14284v15 citationsh-index: 20Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of learning from streaming data with noisy annotations, which is critical for real-world AI applications, though it is an incremental improvement on existing replay-based strategies.

The paper tackles the problem of Continual Learning with Noisy Labels by introducing Alternate Experience Replay (AER) with Asymmetric Balanced Sampling (ABS), achieving an average accuracy gain of 4.71% over existing methods.

Forgetting presents a significant challenge during incremental training, making it particularly demanding for contemporary AI systems to assimilate new knowledge in streaming data environments. To address this issue, most approaches in Continual Learning (CL) rely on the replay of a restricted buffer of past data. However, the presence of noise in real-world scenarios, where human annotation is constrained by time limitations or where data is automatically gathered from the web, frequently renders these strategies vulnerable. In this study, we address the problem of CL under Noisy Labels (CLN) by introducing Alternate Experience Replay (AER), which takes advantage of forgetting to maintain a clear distinction between clean, complex, and noisy samples in the memory buffer. The idea is that complex or mislabeled examples, which hardly fit the previously learned data distribution, are most likely to be forgotten. To grasp the benefits of such a separation, we equip AER with Asymmetric Balanced Sampling (ABS): a new sample selection strategy that prioritizes purity on the current task while retaining relevant samples from the past. Through extensive computational comparisons, we demonstrate the effectiveness of our approach in terms of both accuracy and purity of the obtained buffer, resulting in a remarkable average gain of 4.71% points in accuracy with respect to existing loss-based purification strategies. Code is available at https://github.com/aimagelab/mammoth.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes