LACE: Loss-Adaptive Capacity Expansion for Continual Learning
This addresses the challenge for practitioners who must predefine model capacity without knowing data complexity, offering an incremental improvement for on-device continual learning under resource constraints.
The paper tackled the problem of fixed representational capacity in continual learning by proposing LACE, a loss-adaptive mechanism that expands model capacity online based on loss signals, achieving 100% boundary precision in expansions and matching the accuracy of larger fixed-capacity models while starting with fewer dimensions.
Fixed representational capacity is a fundamental constraint in continual learning: practitioners must guess an appropriate model width before training, without knowing how many distinct concepts the data contains. We propose LACE (Loss-Adaptive Capacity Expansion), a simple online mechanism that expands a model's representational capacity during training by monitoring its own loss signal. When sustained loss deviation exceeds a threshold - indicating that the current capacity is insufficient for newly encountered data - LACE adds new dimensions to the projection layer and trains them jointly with existing parameters. Across synthetic and real-data experiments, LACE triggers expansions exclusively at domain boundaries (100% boundary precision, zero false positives), matches the accuracy of a large fixed-capacity model while starting from a fraction of its dimensions, and produces adapter dimensions that are collectively critical to performance (3% accuracy drop when all adapters removed). We further demonstrate unsupervised domain separation in GPT-2 activations via layer-wise clustering, showing a U-shaped separability curve across layers that motivates adaptive capacity allocation in deep networks. LACE requires no labels, no replay buffers, and no external controllers, making it suitable for on-device continual learning under resource constraints.