ITLGSIMLFeb 11, 2016

On the Difficulty of Selecting Ising Models with Approximate Recovery

arXiv:1602.03647v212 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of graph selection in statistical learning for researchers, revealing that approximate recovery is often as hard as exact recovery, which is incremental but clarifies theoretical limits.

The paper tackles the problem of estimating graphs in Ising models under an approximate recovery criterion, showing that for many graph classes, the sample complexity lower bounds match those for exact recovery, indicating similar difficulty.

In this paper, we consider the problem of estimating the underlying graph associated with an Ising model given a number of independent and identically distributed samples. We adopt an \emph{approximate recovery} criterion that allows for a number of missed edges or incorrectly-included edges, in contrast with the widely-studied exact recovery problem. Our main results provide information-theoretic lower bounds on the sample complexity for graph classes imposing constraints on the number of edges, maximal degree, and other properties. We identify a broad range of scenarios where, either up to constant factors or logarithmic factors, our lower bounds match the best known lower bounds for the exact recovery criterion, several of which are known to be tight or near-tight. Hence, in these cases, approximate recovery has a similar difficulty to exact recovery in the minimax sense. Our bounds are obtained via a modification of Fano's inequality for handling the approximate recovery criterion, along with suitably-designed ensembles of graphs that can broadly be classed into two categories: (i) Those containing graphs that contain several isolated edges or cliques and are thus difficult to distinguish from the empty graph; (ii) Those containing graphs for which certain groups of nodes are highly correlated, thus making it difficult to determine precisely which edges connect them. We support our theoretical results on these ensembles with numerical experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes