LGAIAug 11, 2025

On Understanding of the Dynamics of Model Capacity in Continual Learning

arXiv:2508.08052v2h-index: 3
AI Analysis

This addresses the fundamental challenge of balancing stability and plasticity in continual learning, which is crucial for AI systems that learn sequentially, though it is incremental as it builds on existing theory.

The paper tackles the stability-plasticity dilemma in continual learning by introducing effective model capacity (CLEMC) to characterize its dynamic behavior, showing that a neural network's ability to represent new tasks diminishes when task distributions differ, with experiments across architectures from small networks to large language models.

The stability-plasticity dilemma, closely related to a neural network's (NN) capacity-its ability to represent tasks-is a fundamental challenge in continual learning (CL). Within this context, we introduce CL's effective model capacity (CLEMC) that characterizes the dynamic behavior of the stability-plasticity balance point. We develop a difference equation to model the evolution of the interplay between the NN, task data, and optimization procedure. We then leverage CLEMC to demonstrate that the effective capacity-and, by extension, the stability-plasticity balance point is inherently non-stationary. We show that regardless of the NN architecture or optimization method, a NN's ability to represent new tasks diminishes when incoming task distributions differ from previous ones. We conduct extensive experiments to support our theoretical findings, spanning a range of architectures-from small feedforward network and convolutional networks to medium-sized graph neural networks and transformer-based large language models with millions of parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes