LGAIMLOct 29, 2018

Learning to Learn without Forgetting by Maximizing Transfer and Minimizing Interference

arXiv:1810.11910v3920 citations
Originality Incremental advance
AI Analysis

This addresses the problem of catastrophic forgetting in neural networks for researchers and practitioners in machine learning, representing an incremental improvement over existing continual learning methods.

The paper tackles the challenge of continual learning over non-stationary data by proposing a method that optimizes the trade-off between transfer and interference through gradient alignment, resulting in consistent outperformance of baselines in benchmarks, with performance gaps increasing in more non-stationary environments and with smaller stored experience fractions.

Lack of performance when it comes to continual learning over non-stationary distributions of data remains a major challenge in scaling neural network learning to more human realistic settings. In this work we propose a new conceptualization of the continual learning problem in terms of a temporally symmetric trade-off between transfer and interference that can be optimized by enforcing gradient alignment across examples. We then propose a new algorithm, Meta-Experience Replay (MER), that directly exploits this view by combining experience replay with optimization based meta-learning. This method learns parameters that make interference based on future gradients less likely and transfer based on future gradients more likely. We conduct experiments across continual lifelong supervised learning benchmarks and non-stationary reinforcement learning environments demonstrating that our approach consistently outperforms recently proposed baselines for continual learning. Our experiments show that the gap between the performance of MER and baseline algorithms grows both as the environment gets more non-stationary and as the fraction of the total experiences stored gets smaller.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes