LGDec 19, 2024

Balanced Gradient Sample Retrieval for Enhanced Knowledge Retention in Proxy-based Continual Learning

Hongye Xu, Jan Wasilewski, Bartosz Krawczyk

arXiv:2412.14430v12.6h-index: 3

Originality Incremental advance

AI Analysis

This addresses catastrophic forgetting in deep neural networks for continual learning applications, representing an incremental improvement over existing proxy-based methods.

The paper tackles catastrophic forgetting in continual learning by proposing a balanced gradient sample retrieval strategy that uses both gradient-conflicting and gradient-aligned samples to enhance knowledge retention. Empirical results show state-of-the-art performance on benchmarks, with improved retention and competitive accuracy on new tasks.

Continual learning in deep neural networks often suffers from catastrophic forgetting, where representations for previous tasks are overwritten during subsequent training. We propose a novel sample retrieval strategy from the memory buffer that leverages both gradient-conflicting and gradient-aligned samples to effectively retain knowledge about past tasks within a supervised contrastive learning framework. Gradient-conflicting samples are selected for their potential to reduce interference by re-aligning gradients, thereby preserving past task knowledge. Meanwhile, gradient-aligned samples are incorporated to reinforce stable, shared representations across tasks. By balancing gradient correction from conflicting samples with alignment reinforcement from aligned ones, our approach increases the diversity among retrieved instances and achieves superior alignment in parameter space, significantly enhancing knowledge retention and mitigating proxy drift. Empirical results demonstrate that using both sample types outperforms methods relying solely on one sample type or random retrieval. Experiments on popular continual learning benchmarks in computer vision validate our method's state-of-the-art performance in mitigating forgetting while maintaining competitive accuracy on new tasks.

View on arXiv PDF

Similar