LGApr 29, 2023

Taming graph kernels with random features

arXiv:2305.00156v121 citationsh-index: 34
Originality Incremental advance
AI Analysis

This addresses a critical bottleneck in graph analysis for researchers and practitioners by making kernel methods computationally feasible for large-scale graph data, though it is an incremental improvement building on existing random feature techniques.

The paper tackles the high computational complexity of graph kernel methods by introducing graph random features (GRFs), which provide unbiased estimators for kernels like the regularized Laplacian kernel, reducing time complexity from cubic to linear and enabling scalability to larger networks with substantial speed gains.

We introduce in this paper the mechanism of graph random features (GRFs). GRFs can be used to construct unbiased randomized estimators of several important kernels defined on graphs' nodes, in particular the regularized Laplacian kernel. As regular RFs for non-graph kernels, they provide means to scale up kernel methods defined on graphs to larger networks. Importantly, they give substantial computational gains also for smaller graphs, while applied in downstream applications. Consequently, GRFs address the notoriously difficult problem of cubic (in the number of the nodes of the graph) time complexity of graph kernels algorithms. We provide a detailed theoretical analysis of GRFs and an extensive empirical evaluation: from speed tests, through Frobenius relative error analysis to kmeans graph-clustering with graph kernels. We show that the computation of GRFs admits an embarrassingly simple distributed algorithm that can be applied if the graph under consideration needs to be split across several machines. We also introduce a (still unbiased) quasi Monte Carlo variant of GRFs, q-GRFs, relying on the so-called reinforced random walks, that might be used to optimize the variance of GRFs. As a byproduct, we obtain a novel approach to solve certain classes of linear equations with positive and symmetric matrices.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes