LGJul 13, 2023

Frameless Graph Knowledge Distillation

arXiv:2307.06631v14 citationsh-index: 16Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficiently transferring graph knowledge in GNNs for applications requiring fast inference, though it appears incremental by building on existing KD and graph framelet methods.

The paper tackles the problem of knowledge distillation for graph neural networks by proposing a framework that utilizes multi-scaled graph knowledge from graph framelet decomposition, resulting in student models achieving identical or superior accuracy to teacher models while maintaining high inference speed.

Knowledge distillation (KD) has shown great potential for transferring knowledge from a complex teacher model to a simple student model in which the heavy learning task can be accomplished efficiently and without losing too much prediction accuracy. Recently, many attempts have been made by applying the KD mechanism to the graph representation learning models such as graph neural networks (GNNs) to accelerate the model's inference speed via student models. However, many existing KD-based GNNs utilize MLP as a universal approximator in the student model to imitate the teacher model's process without considering the graph knowledge from the teacher model. In this work, we provide a KD-based framework on multi-scaled GNNs, known as graph framelet, and prove that by adequately utilizing the graph knowledge in a multi-scaled manner provided by graph framelet decomposition, the student model is capable of adapting both homophilic and heterophilic graphs and has the potential of alleviating the over-squashing issue with a simple yet effectively graph surgery. Furthermore, we show how the graph knowledge supplied by the teacher is learned and digested by the student model via both algebra and geometry. Comprehensive experiments show that our proposed model can generate learning accuracy identical to or even surpass the teacher model while maintaining the high speed of inference.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes