Essentials for Class Incremental Learning
This addresses the problem of neural networks forgetting old tasks when learning new ones, which is crucial for real-world applications with evolving data, though it is incremental as it builds on existing methods.
The paper tackled catastrophic forgetting in class-incremental learning by showing that a simple combination of components and a balanced loss resolves forgetting as effectively as complex methods, and identified representation quality as a key factor, improving state-of-the-art results on CIFAR-100 and ImageNet by a large margin.
Contemporary neural networks are limited in their ability to learn from evolving streams of training data. When trained sequentially on new or evolving tasks, their accuracy drops sharply, making them unsuitable for many real-world applications. In this work, we shed light on the causes of this well-known yet unsolved phenomenon - often referred to as catastrophic forgetting - in a class-incremental setup. We show that a combination of simple components and a loss that balances intra-task and inter-task learning can already resolve forgetting to the same extent as more complex measures proposed in literature. Moreover, we identify poor quality of the learned representation as another reason for catastrophic forgetting in class-IL. We show that performance is correlated with secondary class information (dark knowledge) learned by the model and it can be improved by an appropriate regularizer. With these lessons learned, class-incremental learning results on CIFAR-100 and ImageNet improve over the state-of-the-art by a large margin, while keeping the approach simple.