CL AI LGDec 20, 2024

In-context Continual Learning Assisted by an External Continual Learner

Saleh Momeni, Sahisnu Mazumder, Zixuan Ke, Bing Liu

arXiv:2412.15563v113.822 citationsh-index: 17COLING

Originality Incremental advance

AI Analysis

This addresses scalability issues in continual learning for large language models, though it is incremental as it builds on existing in-context learning methods.

The paper tackles catastrophic forgetting in continual learning by integrating an external continual learner with in-context learning to pre-select classes and reduce prompt length, achieving significant performance gains over baselines.

Existing continual learning (CL) methods mainly rely on fine-tuning or adapting large language models (LLMs). They still suffer from catastrophic forgetting (CF). Little work has been done to exploit in-context learning (ICL) to leverage the extensive knowledge within LLMs for CL without updating any parameters. However, incrementally learning each new task in ICL necessitates adding training examples from each class of the task to the prompt, which hampers scalability as the prompt length increases. This issue not only leads to excessively long prompts that exceed the input token limit of the underlying LLM but also degrades the model's performance due to the overextended context. To address this, we introduce InCA, a novel approach that integrates an external continual learner (ECL) with ICL to enable scalable CL without CF. The ECL is built incrementally to pre-select a small subset of likely classes for each test instance. By restricting the ICL prompt to only these selected classes, InCA prevents prompt lengths from becoming excessively long, while maintaining high performance. Experimental results demonstrate that InCA significantly outperforms existing CL baselines, achieving substantial performance gains.

View on arXiv PDF

Similar