SIIROct 22, 2020

Spikyball sampling: Exploring large networks via an inhomogeneous filtered diffusion

arXiv:2010.11786v14 citations
Originality Incremental advance
AI Analysis

This method addresses the problem of efficiently sampling relevant parts of large networks for researchers and analysts, but it is incremental as it generalizes and extends existing approaches like Snowball or Forest Fire sampling.

The authors tackled the challenge of exploring large, complex networks by proposing a new sampling method called spikyball sampling, which uses a filtered breadth-first search with random neighbor selection to capture highly interactive groups while discarding weakly connected nodes.

Studying real-world networks such as social networks or web networks is a challenge. These networks often combine a complex, highly connected structure together with a large size. We propose a new approach for large scale networks that is able to automatically sample user-defined relevant parts of a network. Starting from a few selected places in the network and a reduced set of expansion rules, the method adopts a filtered breadth-first search approach, that expands through edges and nodes matching these properties. Moreover, the expansion is performed over a random subset of neighbors at each step to mitigate further the overwhelming number of connections that may exist in large graphs. This carries the image of a "spiky" expansion. We show that this approach generalize previous exploration sampling methods, such as Snowball or Forest Fire and extend them. We demonstrate its ability to capture groups of nodes with high interactions while discarding weakly connected nodes that are often numerous in social networks and may hide important structures.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes