LGOct 20, 2024

Hybrid Memory Replay: Blending Real and Distilled Data for Class Incremental Learning

Jiangtao Kong, Jiacheng Shi, Ashley Gao, Shaohan Hu, Tianyi Zhou, Huajie Shao

arXiv:2410.15372v16.44 citationsh-index: 2

Originality Incremental advance

AI Analysis

This work addresses the challenge of limited buffer size in incremental learning for AI systems, offering an incremental improvement by optimizing hybrid memory to enhance knowledge retention.

The paper tackles the problem of catastrophic forgetting in class incremental learning by proposing a hybrid memory replay method that blends real and distilled synthetic data, achieving significant performance improvements over existing replay-based baselines across multiple benchmarks.

Incremental learning (IL) aims to acquire new knowledge from current tasks while retaining knowledge learned from previous tasks. Replay-based IL methods store a set of exemplars from previous tasks in a buffer and replay them when learning new tasks. However, there is usually a size-limited buffer that cannot store adequate real exemplars to retain the knowledge of previous tasks. In contrast, data distillation (DD) can reduce the exemplar buffer's size, by condensing a large real dataset into a much smaller set of more information-compact synthetic exemplars. Nevertheless, DD's performance gain on IL quickly vanishes as the number of synthetic exemplars grows. To overcome the weaknesses of real-data and synthetic-data buffers, we instead optimize a hybrid memory including both types of data. Specifically, we propose an innovative modification to DD that distills synthetic data from a sliding window of checkpoints in history (rather than checkpoints on multiple training trajectories). Conditioned on the synthetic data, we then optimize the selection of real exemplars to provide complementary improvement to the DD objective. The optimized hybrid memory combines the strengths of synthetic and real exemplars, effectively mitigating catastrophic forgetting in Class IL (CIL) when the buffer size for exemplars is limited. Notably, our method can be seamlessly integrated into most existing replay-based CIL models. Extensive experiments across multiple benchmarks demonstrate that our method significantly outperforms existing replay-based baselines.

View on arXiv PDF

Similar