MLNov 21, 2016

MDL-motivated compression of GLM ensembles increases interpretability and retains predictive power

Boris Hayete, Matthew Valko, Alex Greenfield, Raymond Yan

arXiv:1611.06800v12.54 citations

Originality Incremental advance

AI Analysis

This addresses the trade-off between predictive power and interpretability for users of ensemble methods in machine learning, offering an incremental improvement.

The paper tackles the loss of interpretability in ensembles of generalized linear models (GLMs) by applying minimum description length (MDL)-motivated compression, which recovers interpretability while retaining predictive performance on standard classification datasets.

Over the years, ensemble methods have become a staple of machine learning. Similarly, generalized linear models (GLMs) have become very popular for a wide variety of statistical inference tasks. The former have been shown to enhance out- of-sample predictive power and the latter possess easy interpretability. Recently, ensembles of GLMs have been proposed as a possibility. On the downside, this approach loses the interpretability that GLMs possess. We show that minimum description length (MDL)-motivated compression of the inferred ensembles can be used to recover interpretability without much, if any, downside to performance and illustrate on a number of standard classification data sets.

View on arXiv PDF

Similar