CVMay 5

Dynamic Distillation and Gradient Consistency for Robust Long-Tailed Incremental Learning

arXiv:2605.033645.8
AI Analysis

For practitioners in continual learning with imbalanced data, this work provides a practical solution that improves robustness without significant computational overhead.

The paper tackles long-tailed class incremental learning, where class imbalance exacerbates catastrophic forgetting. The proposed method achieves up to 5.0% accuracy improvements on CIFAR-100-LT, ImageNetSubset-LT, and Food101-LT benchmarks, with dramatic gains in the challenging 'In-ordered' setting.

The task of Long-tailed Class Incremental Learning (LT-CIL) addresses the sequential learning of new classes from datasets with imbalanced class distributions. This scenario intensifies the fundamental problem of catastrophic forgetting, inherent to continual learning, with the dual challenges of under-learning minority classes and overfitting majority classes. To tackle these combined issues, this paper proposes two main techniques. First, we introduce gradient consistency regularization, which leverages the moving average of gradients to suppress abrupt fluctuations and stabilize the training process. Second, we dynamically adjust the weight of the distillation loss by measuring the degree of class imbalance with normalized entropy. This adaptive weighting establishes an optimal balance between retaining old knowledge and acquiring new information. Experiments on the CIFAR-100-LT, ImageNetSubset-LT, and Food101-LT benchmarks show that our method achieves consistent accuracy improvements of up to 5.0\%. Furthermore, we demonstrate dramatic gains in the challenging 'In-ordered' setting, where tasks progress from majority to minority classes, highlighting our method's robustness in mitigating forgetting under unfavorable learning dynamics. This enhanced performance is achieved without a significant increase in computational overhead, demonstrating the practicality of our framework.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes