CVSep 27, 2024

LW2G: Learning Whether to Grow for Prompt-based Continual Learning

Qian Feng, Da-wei Zhou, Hanbin Zhao, Chao Zhang, Jiahua Dong, Dengxin Dai, Hui Qian

arXiv:2409.18860v210.510 citationsh-index: 21Has Code

Originality Incremental advance

AI Analysis

This work addresses challenges in continual learning for AI systems that need to adapt to new tasks without forgetting old ones, offering an incremental improvement over existing prompt-based methods.

The paper tackles the problem of unbounded prompt pool growth and low selection accuracy in prompt-based continual learning by proposing LW2G, a plug-in method that uses a metric called Hinder Forward Capability to dynamically decide when to add new prompt sets, achieving intra-task knowledge sharing and improved efficiency.

Recent Prompt-based Continual learning (PCL) has achieved remarkable performance with pre-trained models. These approaches expand a prompt pool by adding a new set of prompts while learning and select the correct set during inference. Previous studies have revealed that learning task-wised prompt sets individually and low selection accuracy pose challenges to the performance of PCL. In this paper, we propose a plug-in method, $\textbf{L}$earning $\textbf{W}$hether $\textbf{t}$o $\textbf{G}$row $\textbf{(LW2G)}$, which leverages the disparities between tasks to form an effective and efficient prompt sets pool, thereby achieving intra-task knowledge sharing and cooperation and avoiding the unbounded increase in the cost of the prompt pool. Specifically, a shared set is utilized when several tasks share certain commonalities, and a new set is added when there are significant differences between the new and previous tasks. To achieve this, we develop a metric called Hinder Forward Capability (HFC) to measure the hindrance imposed on learning new tasks by surgically modifying the original gradient onto the orthogonal complement of the old feature space. With HFC, an automated scheme, Dynamic Growing Approach, adaptively learns whether to grow with a dynamic threshold. Furthermore, we design a gradient-based constraint to ensure consistency between the updating prompts and pre-trained knowledge. Extensive experiments show the effectiveness of our method. Code is available at https://github.com/RAIAN08/LW2G.

View on arXiv PDF Code

Similar