LGIRSIDec 12, 2022

Dirichlet-Survival Process: Scalable Inference of Topic-Dependent Diffusion Networks

arXiv:2212.05996v12 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the challenge of scalable inference for topic-dependent diffusion networks, offering a more comprehensive model for researchers in network analysis and information diffusion, though it builds incrementally on existing Dirichlet-Point processes.

The paper tackled the problem of jointly modeling information spread on networks by considering content, timing, and network position, introducing the Houston model in a non-parametric unsupervised framework. It resulted in significant improvements over baselines in cluster recovery and subnetworks inference tasks.

Information spread on networks can be efficiently modeled by considering three features: documents' content, time of publication relative to other publications, and position of the spreader in the network. Most previous works model up to two of those jointly, or rely on heavily parametric approaches. Building on recent Dirichlet-Point processes literature, we introduce the Houston (Hidden Online User-Topic Network) model, that jointly considers all those features in a non-parametric unsupervised framework. It infers dynamic topic-dependent underlying diffusion networks in a continuous-time setting along with said topics. It is unsupervised; it considers an unlabeled stream of triplets shaped as \textit{(time of publication, information's content, spreading entity)} as input data. Online inference is conducted using a sequential Monte-Carlo algorithm that scales linearly with the size of the dataset. Our approach yields consequent improvements over existing baselines on both cluster recovery and subnetworks inference tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes