LG MLFeb 17, 2020

Residual Continual Learning

Janghyeon Lee, Donggyu Joo, Hyeong Gwon Hong, Junmo Kim

arXiv:2002.06774v112.025 citations

Originality Incremental advance

AI Analysis

This addresses the problem of forgetting in sequential tasks for AI systems, though it appears incremental as it builds on existing continual learning methods.

The paper tackles catastrophic forgetting in continual learning by introducing Residual Continual Learning (ResCL), which reparameterizes network parameters without increasing model size, and reports state-of-the-art performance across various scenarios.

We propose a novel continual learning method called Residual Continual Learning (ResCL). Our method can prevent the catastrophic forgetting phenomenon in sequential learning of multiple tasks, without any source task information except the original network. ResCL reparameterizes network parameters by linearly combining each layer of the original network and a fine-tuned network; therefore, the size of the network does not increase at all. To apply the proposed method to general convolutional neural networks, the effects of batch normalization layers are also considered. By utilizing residual-learning-like reparameterization and a special weight decay loss, the trade-off between source and target performance is effectively controlled. The proposed method exhibits state-of-the-art performance in various continual learning scenarios.

View on arXiv PDF

Similar