LG AI CVMay 18, 2025

Scalable Strategies for Continual Learning with Replay

arXiv:2505.12512v111.43 citationsh-index: 1

Originality Incremental advance

AI Analysis

This work addresses the problem of inefficient replay in continual learning for AI systems, offering incremental improvements by integrating scalable techniques like low rank adaptation and model merging.

The paper tackles the scalability challenge in continual learning with replay by introducing consolidation to reduce replay samples by up to 55% and sequential merging, resulting in a synergistic toolset that outperforms standalone methods.

Future deep learning models will be distinguished by systems that perpetually learn through interaction, imagination, and cooperation, blurring the line between training and inference. This makes continual learning a critical challenge, as methods that efficiently maximize bidirectional transfer across learning trajectories will be essential. Replay is on track to play a foundational role in continual learning, allowing models to directly reconcile new information with past knowledge. In practice, however, replay is quite unscalable, doubling the cost of continual learning when applied naively. Moreover, the continual learning literature has not fully synchronized with the multi-task fine-tuning literature, having not fully integrated highly scalable techniques like model merging and low rank adaptation into a replay-enabled toolset that can produce a unified model in the face of many sequential tasks. In this paper, we begin by applying and analyzing low rank adaptation in a continual learning setting. Next, we introduce consolidation, a phasic approach to replay which leads to up to 55\% less replay samples being needed for a given performance target. Then, we propose sequential merging, an offshoot of task arithmetic which is tailored to the continual learning setting and is shown to work well in combination with replay. Finally, we demonstrate that the developed strategies can operate synergistically, resulting in a highly scalable toolset that outperforms standalone variants.

View on arXiv PDF

Similar