LGFeb 9, 2025

Learning Accurate, Efficient, and Interpretable MLPs on Multiplex Graphs via Node-wise Multi-View Ensemble Distillation

arXiv:2502.05864v13 citationsh-index: 4DASFAA
Originality Incremental advance
AI Analysis

This work addresses the problem of high inference latency for multiplex graph models, making them more practical for real-time applications, though it is incremental as it builds on existing GNN-to-MLP distillation frameworks.

The paper tackles the challenge of deploying multiplex graph neural networks (MGNNs) in latency-sensitive applications by proposing Multiplex Graph-Free Neural Networks (MGFNN and MGFNN+), which use knowledge distillation to train student MLPs for efficient inference. The results show that MGFNNs achieve about 10% higher accuracy than vanilla MLPs, perform comparably to teacher MGNNs, and achieve a 35.40x-89.14x speedup in inference over MGNNs.

Multiplex graphs, with multiple edge types (graph views) among common nodes, provide richer structural semantics and better modeling capabilities. Multiplex Graph Neural Networks (MGNNs), typically comprising view-specific GNNs and a multi-view integration layer, have achieved advanced performance in various downstream tasks. However, their reliance on neighborhood aggregation poses challenges for deployment in latency-sensitive applications. Motivated by recent GNN-to-MLP knowledge distillation frameworks, we propose Multiplex Graph-Free Neural Networks (MGFNN and MGFNN+) to combine MGNNs' superior performance and MLPs' efficient inference via knowledge distillation. MGFNN directly trains student MLPs with node features as input and soft labels from teacher MGNNs as targets. MGFNN+ further employs a low-rank approximation-based reparameterization to learn node-wise coefficients, enabling adaptive knowledge ensemble from each view-specific GNN. This node-wise multi-view ensemble distillation strategy allows student MLPs to learn more informative multiplex semantic knowledge for different nodes. Experiments show that MGFNNs achieve average accuracy improvements of about 10% over vanilla MLPs and perform comparably or even better to teacher MGNNs (accurate); MGFNNs achieve a 35.40$\times$-89.14$\times$ speedup in inference over MGNNs (efficient); MGFNN+ adaptively assigns different coefficients for multi-view ensemble distillation regarding different nodes (interpretable).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes