LGSIFeb 17, 2021

DeepWalking Backwards: From Embeddings Back to Graphs

arXiv:2102.08532v119 citations
Originality Incremental advance
AI Analysis

This provides a more rigorous understanding of graph embeddings for researchers in network analysis and machine learning, but it is incremental as it builds on existing methods.

The paper tackles the problem of understanding what information graph embeddings encode by studying whether embeddings can be inverted to recover the original graph, finding that specific edges and properties like triangle density are often lost, but community structure is preserved or enhanced.

Low-dimensional node embeddings play a key role in analyzing graph datasets. However, little work studies exactly what information is encoded by popular embedding methods, and how this information correlates with performance in downstream machine learning tasks. We tackle this question by studying whether embeddings can be inverted to (approximately) recover the graph used to generate them. Focusing on a variant of the popular DeepWalk method (Perozzi et al., 2014; Qiu et al., 2018), we present algorithms for accurate embedding inversion - i.e., from the low-dimensional embedding of a graph G, we can find a graph H with a very similar embedding. We perform numerous experiments on real-world networks, observing that significant information about G, such as specific edges and bulk properties like triangle density, is often lost in H. However, community structure is often preserved or even enhanced. Our findings are a step towards a more rigorous understanding of exactly what information embeddings encode about the input graph, and why this information is useful for learning tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes