LGMLJun 1, 2022

Transfer without Forgetting

arXiv:2206.00388v269 citationsh-index: 41
Originality Incremental advance
AI Analysis

This addresses the issue of under-exploiting knowledge transfer in continual learning for AI practitioners, representing an incremental improvement over existing methods.

The paper tackles the problem of catastrophic forgetting in network pretraining during continual learning, proposing Transfer without Forgetting (TwF) which achieves an average 4.81% gain in Class-Incremental accuracy across various datasets and buffer sizes.

This work investigates the entanglement between Continual Learning (CL) and Transfer Learning (TL). In particular, we shed light on the widespread application of network pretraining, highlighting that it is itself subject to catastrophic forgetting. Unfortunately, this issue leads to the under-exploitation of knowledge transfer during later tasks. On this ground, we propose Transfer without Forgetting (TwF), a hybrid approach building upon a fixed pretrained sibling network, which continuously propagates the knowledge inherent in the source domain through a layer-wise loss term. Our experiments indicate that TwF steadily outperforms other CL methods across a variety of settings, averaging a 4.81% gain in Class-Incremental accuracy over a variety of datasets and different buffer sizes.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes