LGAICVApr 11, 2023

Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning

arXiv:2304.05288v119 citationsh-index: 20
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient lifelong learning for AI systems by adapting to task differences, though it is incremental as it builds on existing parameter methods.

The paper tackles the problem of catastrophic forgetting in lifelong learning by proposing a method that adaptively selects parameter allocation or regularization strategies based on task difficulty, resulting in reduced model redundancy and improved performance on multiple benchmarks.

Parameter regularization or allocation methods are effective in overcoming catastrophic forgetting in lifelong learning. However, they solve all tasks in a sequence uniformly and ignore the differences in the learning difficulty of different tasks. So parameter regularization methods face significant forgetting when learning a new task very different from learned tasks, and parameter allocation methods face unnecessary parameter overhead when learning simple tasks. In this paper, we propose the Parameter Allocation & Regularization (PAR), which adaptively select an appropriate strategy for each task from parameter allocation and regularization based on its learning difficulty. A task is easy for a model that has learned tasks related to it and vice versa. We propose a divergence estimation method based on the Nearest-Prototype distance to measure the task relatedness using only features of the new task. Moreover, we propose a time-efficient relatedness-aware sampling-based architecture search strategy to reduce the parameter overhead for allocation. Experimental results on multiple benchmarks demonstrate that, compared with SOTAs, our method is scalable and significantly reduces the model's redundancy while improving the model's performance. Further qualitative analysis indicates that PAR obtains reasonable task-relatedness.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes