SILGApr 12, 2022

A Hierarchical Block Distance Model for Ultra Low-Dimensional Graph Representations

arXiv:2204.05885v213 citationsh-index: 34
Originality Incremental advance
AI Analysis

This addresses the need for efficient and interpretable graph analysis for large-scale network applications, though it appears incremental as it builds on existing models like stochastic block modeling and latent distance models.

The paper tackles the problem of scalable graph representation learning by proposing the Hierarchical Block Distance Model (HBDM), which significantly outperforms recent scalable approaches on massive networks with millions of nodes, achieving superior performance even with ultra-low two-dimensional embeddings.

Graph Representation Learning (GRL) has become central for characterizing structures of complex networks and performing tasks such as link prediction, node classification, network reconstruction, and community detection. Whereas numerous generative GRL models have been proposed, many approaches have prohibitive computational requirements hampering large-scale network analysis, fewer are able to explicitly account for structure emerging at multiple scales, and only a few explicitly respect important network properties such as homophily and transitivity. This paper proposes a novel scalable graph representation learning method named the Hierarchical Block Distance Model (HBDM). The HBDM imposes a multiscale block structure akin to stochastic block modeling (SBM) and accounts for homophily and transitivity by accurately approximating the latent distance model (LDM) throughout the inferred hierarchy. The HBDM naturally accommodates unipartite, directed, and bipartite networks whereas the hierarchy is designed to ensure linearithmic time and space complexity enabling the analysis of very large-scale networks. We evaluate the performance of the HBDM on massive networks consisting of millions of nodes. Importantly, we find that the proposed HBDM framework significantly outperforms recent scalable approaches in all considered downstream tasks. Surprisingly, we observe superior performance even imposing ultra-low two-dimensional embeddings facilitating accurate direct and hierarchical-aware network visualization and interpretation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes