LGAIFeb 20, 2022

Continual Learning Beyond a Single Model

arXiv:2202.09826v324 citations
Originality Incremental advance
AI Analysis

This work addresses the computational inefficiency of ensembles in continual learning, offering a practical solution for researchers and practitioners dealing with sequential tasks.

The paper tackles the catastrophic forgetting problem in continual learning by questioning the assumption of using a single model and shows that ensemble models improve performance, but proposes a computationally cheap algorithm based on neural network subspaces to reduce costs while maintaining benefits.

A growing body of research in continual learning focuses on the catastrophic forgetting problem. While many attempts have been made to alleviate this problem, the majority of the methods assume a single model in the continual learning setup. In this work, we question this assumption and show that employing ensemble models can be a simple yet effective method to improve continual performance. However, ensembles' training and inference costs can increase significantly as the number of models grows. Motivated by this limitation, we study different ensemble models to understand their benefits and drawbacks in continual learning scenarios. Finally, to overcome the high compute cost of ensembles, we leverage recent advances in neural network subspace to propose a computationally cheap algorithm with similar runtime to a single model yet enjoying the performance benefits of ensembles.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes