MEMLJul 6, 2021

T-LoHo: A Bayesian Regularization Model for Structured Sparsity and Smoothness on Graphs

arXiv:2107.02510v21 citations
AI Analysis

This addresses the need for better regularization in Bayesian models for graph-structured data, offering a novel prior that handles clustering and contiguity constraints, though it is incremental as it builds on existing horseshoe shrinkage methods.

The paper tackles the problem of detecting structured sparsity and smoothness in high-dimensional parameters on graphs, proposing the T-LoHo model, which shows substantial improvements over methods like sparse fused lasso in simulation and real-world anomaly detection on a road network.

Graphs have been commonly used to represent complex data structures. In models dealing with graph-structured data, multivariate parameters may not only exhibit sparse patterns but have structured sparsity and smoothness in the sense that both zero and non-zero parameters tend to cluster together. We propose a new prior for high-dimensional parameters with graphical relations, referred to as the Tree-based Low-rank Horseshoe (T-LoHo) model, that generalizes the popular univariate Bayesian horseshoe shrinkage prior to the multivariate setting to detect structured sparsity and smoothness simultaneously. The T-LoHo prior can be embedded in many high-dimensional hierarchical models. To illustrate its utility, we apply it to regularize a Bayesian high-dimensional regression problem where the regression coefficients are linked by a graph, so that the resulting clusters have flexible shapes and satisfy the cluster contiguity constraint with respect to the graph. We design an efficient Markov chain Monte Carlo algorithm that delivers full Bayesian inference with uncertainty measures for model parameters such as the number of clusters. We offer theoretical investigations of the clustering effects and posterior concentration results. Finally, we illustrate the performance of the model with simulation studies and a real data application for anomaly detection on a road network. The results indicate substantial improvements over other competing methods such as the sparse fused lasso.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes