LGNEMLJan 7, 2020

Frosting Weights for Better Continual Training

arXiv:2001.01829v15 citations
AI Analysis

This addresses the problem of maintaining model stability for practitioners in machine learning, though it appears incremental as it builds on existing ensemble methods.

The paper tackles catastrophic forgetting during continual training of neural networks by proposing two ensemble approaches, gradient boosting and meta-learning, to preserve performance when tuning pre-trained models.

Training a neural network model can be a lifelong learning process and is a computationally intensive one. A severe adverse effect that may occur in deep neural network models is that they can suffer from catastrophic forgetting during retraining on new data. To avoid such disruptions in the continuous learning, one appealing property is the additive nature of ensemble models. In this paper, we propose two generic ensemble approaches, gradient boosting and meta-learning, to solve the catastrophic forgetting problem in tuning pre-trained neural network models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes