Reconstructing networks with unknown and heterogeneous errors
This addresses the issue of inaccurate network analysis due to unaccounted errors, offering a more flexible method for researchers in network science, though it is incremental as it builds on existing Bayesian approaches.
The paper tackles the problem of network reconstruction from noisy data with unknown and heterogeneous errors, developing a Bayesian approach that works with single edge measurements and yields a principled method to infer hierarchical community structure, demonstrating efficacy on empirical and artificial networks.
The vast majority of network datasets contains errors and omissions, although this is rarely incorporated in traditional network analysis. Recently, an increasing effort has been made to fill this methodological gap by developing network reconstruction approaches based on Bayesian inference. These approaches, however, rely on assumptions of uniform error rates and on direct estimations of the existence of each edge via repeated measurements, something that is currently unavailable for the majority of network data. Here we develop a Bayesian reconstruction approach that lifts these limitations by not only allowing for heterogeneous errors, but also for single edge measurements without direct error estimates. Our approach works by coupling the inference approach with structured generative network models, which enable the correlations between edges to be used as reliable uncertainty estimates. Although our approach is general, we focus on the stochastic block model as the basic generative process, from which efficient nonparametric inference can be performed, and yields a principled method to infer hierarchical community structure from noisy data. We demonstrate the efficacy of our approach with a variety of empirical and artificial networks.