SILGSOC-PHJun 29, 2020

Approximating Network Centrality Measures Using Node Embedding and Machine Learning

arXiv:2006.16392v426 citations
AI Analysis

This work addresses a computational bottleneck for researchers and practitioners analyzing large-scale networks, though it is incremental as it builds on existing graph embedding and machine learning techniques.

The paper tackles the problem of efficiently approximating node centrality measures in large networks, which is computationally expensive, by proposing NCA-GE, a method using neural networks and graph embedding that achieves O(|E|) time complexity and outperforms state-of-the-art methods in various scenarios.

Extracting information from real-world large networks is a key challenge nowadays. For instance, computing a node centrality may become unfeasible depending on the intended centrality due to its computational cost. One solution is to develop fast methods capable of approximating network centralities. Here, we propose an approach for efficiently approximating node centralities for large networks using Neural Networks and Graph Embedding techniques. Our proposed model, entitled Network Centrality Approximation using Graph Embedding (NCA-GE), uses the adjacency matrix of a graph and a set of features for each node (here, we use only the degree) as input and computes the approximate desired centrality rank for every node. NCA-GE has a time complexity of $O(|E|)$, $E$ being the set of edges of a graph, making it suitable for large networks. NCA-GE also trains pretty fast, requiring only a set of a thousand small synthetic scale-free graphs (ranging from 100 to 1000 nodes each), and it works well for different node centralities, network sizes, and topologies. Finally, we compare our approach to the state-of-the-art method that approximates centrality ranks using the degree and eigenvector centralities as input, where we show that the NCA-GE outperforms the former in a variety of scenarios.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes