SILGMar 13, 2018

VERSE: Versatile Graph Embeddings from Similarity Measures

arXiv:1803.04742v1283 citations
Originality Incremental advance
AI Analysis

This addresses the need for efficient and effective graph embeddings in data mining, though it is incremental as it builds on previous similarity-based approaches.

The paper tackles the problem of embedding large graphs into low-dimensional vectors for tasks like link prediction and classification by proposing VERSE, a method that explicitly preserves vertex similarity distributions, and it outperforms state-of-the-art methods in precision and recall while being more time- and space-efficient.

Embedding a web-scale information network into a low-dimensional vector space facilitates tasks such as link prediction, classification, and visualization. Past research has addressed the problem of extracting such embeddings by adopting methods from words to graphs, without defining a clearly comprehensible graph-related objective. Yet, as we show, the objectives used in past works implicitly utilize similarity measures among graph nodes. In this paper, we carry the similarity orientation of previous works to its logical conclusion; we propose VERtex Similarity Embeddings (VERSE), a simple, versatile, and memory-efficient method that derives graph embeddings explicitly calibrated to preserve the distributions of a selected vertex-to-vertex similarity measure. VERSE learns such embeddings by training a single-layer neural network. While its default, scalable version does so via sampling similarity information, we also develop a variant using the full information per vertex. Our experimental study on standard benchmarks and real-world datasets demonstrates that VERSE, instantiated with diverse similarity measures, outperforms state-of-the-art methods in terms of precision and recall in major data mining tasks and supersedes them in time and space efficiency, while the scalable sampling-based variant achieves equally good results as the non-scalable full variant.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes