LGJul 5, 2022
Probability density estimation for sets of large graphs with respect to spectral information using stochastic block modelsDaniel Ferguson, François G. Meyer
For graph-valued data sampled iid from a distribution $μ$, the sample moments are computed with respect to a choice of metric. In this work, we equip the set of graphs with the pseudo-metric defined by the $\ell_2$ norm between the eigenvalues of the respective adjacency matrices. We use this pseudo metric and the respective sample moments of a graph valued data set to infer the parameters of a distribution $\hatμ$ and interpret this distribution as an approximation of $μ$. We verify experimentally that complex distributions $μ$ can be approximated well taking this approach.
MLJan 15, 2022
Theoretical analysis and computation of the sample Frechet mean for sets of large graphs based on spectral informationDaniel Ferguson, Francois G. Meyer
To characterize the location (mean, median) of a set of graphs, one needs a notion of centrality that is adapted to metric spaces, since graph sets are not Euclidean spaces. A standard approach is to consider the Frechet mean. In this work, we equip a set of graphs with the pseudometric defined by the norm between the eigenvalues of their respective adjacency matrix. Unlike the edit distance, this pseudometric reveals structural changes at multiple scales, and is well adapted to studying various statistical problems for graph-valued data. We describe an algorithm to compute an approximation to the sample Frechet mean of a set of undirected unweighted graphs with a fixed size using this pseudometric.
COMay 30, 2021
On the Number of Edges of the Frechet Mean and Median GraphsDaniel Ferguson, Francois G. Meyer
The availability of large datasets composed of graphs creates an unprecedented need to invent novel tools in statistical learning for graph-valued random variables. To characterize the average of a sample of graphs, one can compute the sample Frechet mean and median graphs. In this paper, we address the following foundational question: does a mean or median graph inherit the structural properties of the graphs in the sample? An important graph property is the edge density; we establish that edge density is an hereditary property, which can be transmitted from a graph sample to its sample Frechet mean or median graphs, irrespective of the method used to estimate the mean or the median. Because of the prominence of the Frechet mean in graph-valued machine learning, this novel theoretical result has some significant practical consequences.
SIMay 10, 2021
Approximate Fréchet Mean for Data Sets of Sparse GraphsDaniel Ferguson, François G. Meyer
To characterize the location (mean, median) of a set of graphs, one needs a notion of centrality that is adapted to metric spaces, since graph sets are not Euclidean spaces. A standard approach is to consider the Fréchet mean. In this work, we equip a set of graph with the pseudometric defined by the $\ell_2$ norm between the eigenvalues of their respective adjacency matrix . Unlike the edit distance, this pseudometric reveals structural changes at multiple scales, and is well adapted to studying various statistical problems on sets of graphs. We describe an algorithm to compute an approximation to the Fréchet mean of a set of undirected unweighted graphs with a fixed size.