CLMay 25, 2023

Overcoming Catastrophic Forgetting in Massively Multilingual Continual Learning

arXiv:2305.16252v1230 citations
Originality Incremental advance
AI Analysis

This addresses the problem of performance degradation in multilingual systems as they incorporate new languages over time, representing an incremental improvement in continual learning methods.

The paper tackled catastrophic forgetting in massively multilingual continual learning with up to 51 languages, and introduced LR ADJUST, a simple learning rate scheduling method that effectively preserves new information without strongly overwriting past knowledge across multiple tasks.

Real-life multilingual systems should be able to efficiently incorporate new languages as data distributions fed to the system evolve and shift over time. To do this, systems need to handle the issue of catastrophic forgetting, where the model performance drops for languages or tasks seen further in its past. In this paper, we study catastrophic forgetting, as well as methods to minimize this, in a massively multilingual continual learning framework involving up to 51 languages and covering both classification and sequence labeling tasks. We present LR ADJUST, a learning rate scheduling method that is simple, yet effective in preserving new information without strongly overwriting past knowledge. Furthermore, we show that this method is effective across multiple continual learning approaches. Finally, we provide further insights into the dynamics of catastrophic forgetting in this massively multilingual setup.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes