LG AIJul 19, 2025

GRID: Scalable Task-Agnostic Prompt-Based Continual Learning for Language Models

Anushka Tiwari, Sayantan Pal, Rohini K. Srihari, Kaiyi Ji

arXiv:2507.14725v37.11 citationsh-index: 35

Originality Incremental advance

AI Analysis

This addresses scalability issues in continual learning for language models, though it is incremental as it builds on existing prompt-based methods.

The paper tackles the problem of performance degradation and memory inefficiency in prompt-based continual learning for large language models by introducing GRID, a framework that improves average accuracy and backward transfer while reducing prompt memory usage.

Prompt-based continual learning (CL) provides a parameter-efficient approach for adapting large language models (LLMs) across task sequences. However, most existing methods rely on task-aware inference and maintain a growing set of task-specific prompts, which introduces two major challenges: (1) severe performance degradation on earlier tasks under task-agnostic inference, and (2) limited scalability due to prompt memory accumulation as task sequences grow. In this paper, we present GRID, a unified framework designed to address these challenges. GRID incorporates a decoding mechanism that enhances backward transfer by leveraging representative inputs, automatic task identification, and constrained decoding. Furthermore, it employs a gradient-guided prompt selection strategy to compress less informative prompts into a single aggregated representation, ensuring scalable and memory-efficient continual learning. Extensive experiments on long-sequence and negative transfer benchmarks show that GRID improves average accuracy and backward transfer, achieves competitive forward transfer, and substantially reduces prompt memory usage.

View on arXiv PDF

Similar