LGAIDSSep 2, 2021

Computing Graph Descriptors on Edge Streams

arXiv:2109.01494v57 citations
Originality Incremental advance
AI Analysis

This work addresses scalability issues in graph analytics for users dealing with large graphs, offering a more memory-efficient solution, though it is incremental as it builds on existing descriptor methods.

The paper tackled the problem of computing graph descriptors for large graphs by developing streaming algorithms that operate on edge streams, avoiding full graph storage and allowing runtime control. The result is scalable algorithms that compute descriptors for graphs with millions of edges within minutes, achieving predictive accuracy comparable to state-of-the-art methods while using only 25% as much memory.

Feature extraction is an essential task in graph analytics. These feature vectors, called graph descriptors, are used in downstream vector-space-based graph analysis models. This idea has proved fruitful in the past, with spectral-based graph descriptors providing state-of-the-art classification accuracy. However, known algorithms to compute meaningful descriptors do not scale to large graphs since: (1) they require storing the entire graph in memory, and (2) the end-user has no control over the algorithm's runtime. In this paper, we present streaming algorithms to approximately compute three different graph descriptors capturing the essential structure of graphs. Operating on edge streams allows us to avoid storing the entire graph in memory, and controlling the sample size enables us to keep the runtime of our algorithms within desired bounds. We demonstrate the efficacy of the proposed descriptors by analyzing the approximation error and classification accuracy. Our scalable algorithms compute descriptors of graphs with millions of edges within minutes. Moreover, these descriptors yield predictive accuracy comparable to the state-of-the-art methods but can be computed using only 25% as much memory.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes