LGNov 8, 2021

Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods

arXiv:2111.04840v386 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses a critical bottleneck for GNNs in real-world applications like e-commerce where many nodes have incomplete or missing connections, though it is an incremental improvement over existing distillation methods.

The paper tackles the problem of Strict Cold Start (SCS) in Graph Neural Networks (GNNs), where nodes have no neighbors, by proposing Cold Brew, a teacher-student distillation approach, and shows superior performance on benchmark and e-commerce datasets.

Graph Neural Networks (GNNs) have achieved state-of-the-art performance in node classification, regression, and recommendation tasks. GNNs work well when rich and high-quality connections are available. However, their effectiveness is often jeopardized in many real-world graphs in which node degrees have power-law distributions. The extreme case of this situation, where a node may have no neighbors, is called Strict Cold Start (SCS). SCS forces the prediction to rely completely on the node's own features. We propose Cold Brew, a teacher-student distillation approach to address the SCS and noisy-neighbor challenges for GNNs. We also introduce feature contribution ratio (FCR), a metric to quantify the behavior of inductive GNNs to solve SCS. We experimentally show that FCR disentangles the contributions of different graph data components and helps select the best architecture for SCS generalization. We further demonstrate the superior performance of Cold Brew on several public benchmark and proprietary e-commerce datasets, where many nodes have either very few or noisy connections. Our source code is available at https://github.com/amazon-research/gnn-tail-generalization.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes