LGMLOct 8, 2018

Efficient Non-parametric Bayesian Hawkes Processes

arXiv:1810.03730v543 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the need for flexible and scalable event modeling in domains like social media analysis, though it is incremental as it builds on existing cluster representation techniques.

The paper tackles the problem of estimating kernel functions in Hawkes processes by developing efficient non-parametric Bayesian methods, resulting in linear time complexity and outperforming state-of-the-art in goodness-of-fit on Twitter diffusion datasets.

In this paper, we develop an efficient nonparametric Bayesian estimation of the kernel function of Hawkes processes. The non-parametric Bayesian approach is important because it provides flexible Hawkes kernels and quantifies their uncertainty. Our method is based on the cluster representation of Hawkes processes. Utilizing the finite support assumption of the Hawkes process, we efficiently sample random branching structures and thus, we split the Hawkes process into clusters of Poisson processes. We derive two algorithms -- a block Gibbs sampler and a maximum a posteriori estimator based on expectation maximization -- and we show that our methods have a linear time complexity, both theoretically and empirically. On synthetic data, we show our methods to be able to infer flexible Hawkes triggering kernels. On two large-scale Twitter diffusion datasets, we show that our methods outperform the current state-of-the-art in goodness-of-fit and that the time complexity is linear in the size of the dataset. We also observe that on diffusions related to online videos, the learned kernels reflect the perceived longevity for different content types such as music or pets videos.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes