CLFeb 18, 2024

Mitigating Catastrophic Forgetting in Multi-domain Chinese Spelling Correction by Multi-stage Knowledge Transfer Framework

arXiv:2402.11422v15 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses a key flaw in CSC models for practical multi-domain applications, though it is incremental as it applies continual learning methods to this specific task.

The paper tackles catastrophic forgetting in multi-domain Chinese Spelling Correction (CSC) by proposing a Multi-stage Knowledge Transfer (MKT) framework, which improves model performance by mitigating knowledge loss when adapting to new domains.

Chinese Spelling Correction (CSC) aims to detect and correct spelling errors in given sentences. Recently, multi-domain CSC has gradually attracted the attention of researchers because it is more practicable. In this paper, we focus on the key flaw of the CSC model when adapting to multi-domain scenarios: the tendency to forget previously acquired knowledge upon learning new domain-specific knowledge (i.e., catastrophic forgetting). To address this, we propose a novel model-agnostic Multi-stage Knowledge Transfer (MKT) framework, which utilizes a continuously evolving teacher model for knowledge transfer in each domain, rather than focusing solely on new domain knowledge. It deserves to be mentioned that we are the first to apply continual learning methods to the multi-domain CSC task. Experiments prove the effectiveness of our proposed method, and further analyses demonstrate the importance of overcoming catastrophic forgetting for improving the model performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes