LG AIOct 22, 2024

Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification

Yihong Luo, Yuhan Chen, Siya Qiu, Yiwei Wang, Chen Zhang, Yan Zhou, Xiaochun Cao, Jing Tang

arXiv:2410.16845v110.47 citationsh-index: 6Has CodeNIPS

Originality Incremental advance

AI Analysis

This work addresses the problem of robust generalization in few-shot node classification for graph learning researchers, presenting an incremental improvement by enhancing SAM efficiency for GNNs.

The paper tackles the challenge of poor generalization in Few-Shot Node Classification (FSNC) with Graph Neural Networks (GNNs) by proposing Fast Graph Sharpness-Aware Minimization (FGSAM), which integrates Sharpness-Aware Minimization (SAM) into GNN training to find flat minima for better generalization. The result shows that FGSAM outperforms standard SAM with lower computational costs and offers faster optimization than base optimizers in most cases, while also achieving competitive performance in standard node classification for heterophilic graphs.

Graph Neural Networks (GNNs) have shown superior performance in node classification. However, GNNs perform poorly in the Few-Shot Node Classification (FSNC) task that requires robust generalization to make accurate predictions for unseen classes with limited labels. To tackle the challenge, we propose the integration of Sharpness-Aware Minimization (SAM)--a technique designed to enhance model generalization by finding a flat minimum of the loss landscape--into GNN training. The standard SAM approach, however, consists of two forward-backward steps in each training iteration, doubling the computational cost compared to the base optimizer (e.g., Adam). To mitigate this drawback, we introduce a novel algorithm, Fast Graph Sharpness-Aware Minimization (FGSAM), that integrates the rapid training of Multi-Layer Perceptrons (MLPs) with the superior performance of GNNs. Specifically, we utilize GNNs for parameter perturbation while employing MLPs to minimize the perturbed loss so that we can find a flat minimum with good generalization more efficiently. Moreover, our method reutilizes the gradient from the perturbation phase to incorporate graph topology into the minimization process at almost zero additional cost. To further enhance training efficiency, we develop FGSAM+ that executes exact perturbations periodically. Extensive experiments demonstrate that our proposed algorithm outperforms the standard SAM with lower computational costs in FSNC tasks. In particular, our FGSAM+ as a SAM variant offers a faster optimization than the base optimizer in most cases. In addition to FSNC, our proposed methods also demonstrate competitive performance in the standard node classification task for heterophilic graphs, highlighting the broad applicability. The code is available at https://github.com/draym28/FGSAM_NeurIPS24.

View on arXiv PDF Code

Similar