LGMLFeb 17, 2020

Residual Continual Learning

arXiv:2002.06774v125 citations
AI Analysis

This addresses the problem of forgetting in sequential tasks for AI systems, though it appears incremental as it builds on existing continual learning methods.

The paper tackles catastrophic forgetting in continual learning by introducing Residual Continual Learning (ResCL), which reparameterizes network parameters without increasing model size, and reports state-of-the-art performance across various scenarios.

We propose a novel continual learning method called Residual Continual Learning (ResCL). Our method can prevent the catastrophic forgetting phenomenon in sequential learning of multiple tasks, without any source task information except the original network. ResCL reparameterizes network parameters by linearly combining each layer of the original network and a fine-tuned network; therefore, the size of the network does not increase at all. To apply the proposed method to general convolutional neural networks, the effects of batch normalization layers are also considered. By utilizing residual-learning-like reparameterization and a special weight decay loss, the trade-off between source and target performance is effectively controlled. The proposed method exhibits state-of-the-art performance in various continual learning scenarios.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes