LG MLJun 1, 2022

Transfer without Forgetting

Matteo Boschini, Lorenzo Bonicelli, Angelo Porrello, Giovanni Bellitto, Matteo Pennisi, Simone Palazzo, Concetto Spampinato, Simone Calderara

arXiv:2206.00388v223.869 citationsh-index: 41Has Code

Originality Incremental advance

AI Analysis

This addresses the issue of under-exploiting knowledge transfer in continual learning for AI practitioners, representing an incremental improvement over existing methods.

The paper tackles the problem of catastrophic forgetting in network pretraining during continual learning, proposing Transfer without Forgetting (TwF) which achieves an average 4.81% gain in Class-Incremental accuracy across various datasets and buffer sizes.

This work investigates the entanglement between Continual Learning (CL) and Transfer Learning (TL). In particular, we shed light on the widespread application of network pretraining, highlighting that it is itself subject to catastrophic forgetting. Unfortunately, this issue leads to the under-exploitation of knowledge transfer during later tasks. On this ground, we propose Transfer without Forgetting (TwF), a hybrid approach building upon a fixed pretrained sibling network, which continuously propagates the knowledge inherent in the source domain through a layer-wise loss term. Our experiments indicate that TwF steadily outperforms other CL methods across a variety of settings, averaging a 4.81% gain in Class-Incremental accuracy over a variety of datasets and different buffer sizes.

View on arXiv PDF Code

Similar