LGAIJul 19, 2025

GRID: Scalable Task-Agnostic Prompt-Based Continual Learning for Language Models

arXiv:2507.14725v31 citationsh-index: 35
Originality Incremental advance
AI Analysis

This addresses scalability issues in continual learning for language models, though it is incremental as it builds on existing prompt-based methods.

The paper tackles the problem of performance degradation and memory inefficiency in prompt-based continual learning for large language models by introducing GRID, a framework that improves average accuracy and backward transfer while reducing prompt memory usage.

Prompt-based continual learning (CL) provides a parameter-efficient approach for adapting large language models (LLMs) across task sequences. However, most existing methods rely on task-aware inference and maintain a growing set of task-specific prompts, which introduces two major challenges: (1) severe performance degradation on earlier tasks under task-agnostic inference, and (2) limited scalability due to prompt memory accumulation as task sequences grow. In this paper, we present GRID, a unified framework designed to address these challenges. GRID incorporates a decoding mechanism that enhances backward transfer by leveraging representative inputs, automatic task identification, and constrained decoding. Furthermore, it employs a gradient-guided prompt selection strategy to compress less informative prompts into a single aggregated representation, ensuring scalable and memory-efficient continual learning. Extensive experiments on long-sequence and negative transfer benchmarks show that GRID improves average accuracy and backward transfer, achieves competitive forward transfer, and substantially reduces prompt memory usage.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes