LG MLOct 14, 2024

Replay-and-Forget-Free Graph Class-Incremental Learning: A Task Profiling and Prompting Approach

Chaoxi Niu, Guansong Pang, Ling Chen, Bing Liu

arXiv:2410.10341v216.421 citationsh-index: 8Has CodeNIPS

Originality Highly original

AI Analysis

This addresses the problem of continual learning on graph-structured data for applications like node classification, offering a novel solution that is both replay-free and forget-free.

The paper tackles class-incremental learning on graph data by proposing a task profiling method that achieves 100% task ID prediction accuracy and a prompting approach that eliminates catastrophic forgetting, resulting in at least 18% higher average accuracy than state-of-the-art methods without requiring data replay.

Class-incremental learning (CIL) aims to continually learn a sequence of tasks, with each task consisting of a set of unique classes. Graph CIL (GCIL) follows the same setting but needs to deal with graph tasks (e.g., node classification in a graph). The key characteristic of CIL lies in the absence of task identifiers (IDs) during inference, which causes a significant challenge in separating classes from different tasks (i.e., inter-task class separation). Being able to accurately predict the task IDs can help address this issue, but it is a challenging problem. In this paper, we show theoretically that accurate task ID prediction on graph data can be achieved by a Laplacian smoothing-based graph task profiling approach, in which each graph task is modeled by a task prototype based on Laplacian smoothing over the graph. It guarantees that the task prototypes of the same graph task are nearly the same with a large smoothing step, while those of different tasks are distinct due to differences in graph structure and node attributes. Further, to avoid the catastrophic forgetting of the knowledge learned in previous graph tasks, we propose a novel graph prompting approach for GCIL which learns a small discriminative graph prompt for each task, essentially resulting in a separate classification model for each task. The prompt learning requires the training of a single graph neural network (GNN) only once on the first task, and no data replay is required thereafter, thereby obtaining a GCIL model being both replay-free and forget-free. Extensive experiments on four GCIL benchmarks show that i) our task prototype-based method can achieve 100% task ID prediction accuracy on all four datasets, ii) our GCIL model significantly outperforms state-of-the-art competing methods by at least 18% in average CIL accuracy, and iii) our model is fully free of forgetting on the four datasets.

View on arXiv PDF Code

Similar