MLNov 21, 2016

MDL-motivated compression of GLM ensembles increases interpretability and retains predictive power

arXiv:1611.06800v14 citations
Originality Incremental advance
AI Analysis

This addresses the trade-off between predictive power and interpretability for users of ensemble methods in machine learning, offering an incremental improvement.

The paper tackles the loss of interpretability in ensembles of generalized linear models (GLMs) by applying minimum description length (MDL)-motivated compression, which recovers interpretability while retaining predictive performance on standard classification datasets.

Over the years, ensemble methods have become a staple of machine learning. Similarly, generalized linear models (GLMs) have become very popular for a wide variety of statistical inference tasks. The former have been shown to enhance out- of-sample predictive power and the latter possess easy interpretability. Recently, ensembles of GLMs have been proposed as a possibility. On the downside, this approach loses the interpretability that GLMs possess. We show that minimum description length (MDL)-motivated compression of the inferred ensembles can be used to recover interpretability without much, if any, downside to performance and illustrate on a number of standard classification data sets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes