LGAICVJul 3, 2021

Split-and-Bridge: Adaptable Class Incremental Learning within a Single Neural Network

arXiv:2107.01349v128 citations
Originality Incremental advance
AI Analysis

This addresses the issue of catastrophic forgetting in continual learning for AI systems that need to adapt to new tasks over time, representing an incremental improvement over existing knowledge distillation approaches.

The paper tackles the problem of class incremental learning in continual learning, where existing methods using knowledge distillation and cross-entropy loss suffer from competitive interference within a single network. The proposed Split-and-Bridge method partially splits and reconnects the network, outperforming state-of-the-art competitors in KD-based continual learning.

Continual learning has been a major problem in the deep learning community, where the main challenge is how to effectively learn a series of newly arriving tasks without forgetting the knowledge of previous tasks. Initiated by Learning without Forgetting (LwF), many of the existing works report that knowledge distillation is effective to preserve the previous knowledge, and hence they commonly use a soft label for the old task, namely a knowledge distillation (KD) loss, together with a class label for the new task, namely a cross entropy (CE) loss, to form a composite loss for a single neural network. However, this approach suffers from learning the knowledge by a CE loss as a KD loss often more strongly influences the objective function when they are in a competitive situation within a single network. This could be a critical problem particularly in a class incremental scenario, where the knowledge across tasks as well as within the new task, both of which can only be acquired by a CE loss, is essentially learned due to the existence of a unified classifier. In this paper, we propose a novel continual learning method, called Split-and-Bridge, which can successfully address the above problem by partially splitting a neural network into two partitions for training the new task separated from the old task and re-connecting them for learning the knowledge across tasks. In our thorough experimental analysis, our Split-and-Bridge method outperforms the state-of-the-art competitors in KD-based continual learning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes