STLGMEMLDec 28, 2023

Inconsistency of cross-validation for structure learning in Gaussian graphical models

arXiv:2312.17047v1h-index: 23AISTATS
Originality Incremental advance
AI Analysis

This addresses a practical challenge in hyperparameter selection for graphical model algorithms, but it is incremental as it highlights known limitations with new theoretical bounds.

The paper demonstrates that cross-validation is inconsistent for structure learning in Gaussian graphical models, providing finite-sample bounds on neighborhood misidentification probability and showing empirical inconsistency compared to other criteria.

Despite numerous years of research into the merits and trade-offs of various model selection criteria, obtaining robust results that elucidate the behavior of cross-validation remains a challenging endeavor. In this paper, we highlight the inherent limitations of cross-validation when employed to discern the structure of a Gaussian graphical model. We provide finite-sample bounds on the probability that the Lasso estimator for the neighborhood of a node within a Gaussian graphical model, optimized using a prediction oracle, misidentifies the neighborhood. Our results pertain to both undirected and directed acyclic graphs, encompassing general, sparse covariance structures. To support our theoretical findings, we conduct an empirical investigation of this inconsistency by contrasting our outcomes with other commonly used information criteria through an extensive simulation study. Given that many algorithms designed to learn the structure of graphical models require hyperparameter selection, the precise calibration of this hyperparameter is paramount for accurately estimating the inherent structure. Consequently, our observations shed light on this widely recognized practical challenge.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes