LGCLFeb 13, 2024

Can LLMs Learn New Concepts Incrementally without Forgetting?

arXiv:2402.08526v32 citationsh-index: 12
Originality Incremental advance
AI Analysis

This addresses the challenge of enabling LLMs to learn new knowledge without forgetting, which is crucial for real-world applications, though it is incremental as it builds on existing IL benchmarks.

The study tackled the problem of catastrophic forgetting in large language models during incremental learning by introducing the Concept-1K dataset with 1,023 new concepts, revealing that models still suffer from forgetting and that methods like LoRA can exacerbate it.

Large Language Models (LLMs) have achieved remarkable success across various tasks, yet their ability to learn incrementally without forgetting remains underexplored. Incremental learning (IL) is crucial as it enables models to acquire new knowledge while retaining previously learned information, akin to human learning. Existing benchmarks for IL are insufficient due to data leakage issues and the overqualification of LLMs. To address these challenges, we introduce Concept-1K, a novel dataset comprising 1,023 recently emerged concepts across diverse domains. The concepts in Concept-1K are discrete, interpretable units of knowledge that allow for fine-grained analysis of learning and forgetting processes. Using Concept-1K as a testbed, we aim to answer the question: ``Can LLMs learn new concepts incrementally without forgetting like humans?'' Our investigation reveals that LLMs still suffer from catastrophic forgetting and that LoRA, despite fine-tuning fewer parameters, may lead to more forgetting on training data. Additionally, we explore the roles of in-context learning, model scale, buffer size, and pretraining in IL performance. These findings highlight the strengths and limitations of LLMs in IL scenarios and provide a robust benchmark for future research.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes