CYLGMLMar 14, 2019

Interpretation of machine learning predictions for patient outcomes in electronic health records

arXiv:1903.12074v159 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of validating predictive models in healthcare for clinicians and researchers, but it is incremental as it compares existing interpretation methods without introducing new ones.

The study tackled the problem of interpreting machine learning predictions for patient outcomes in electronic health records by comparing feature importance methods, finding poor correlation between methods but noting that permutation tests on random forest and gradient boosting models showed the most agreement with clinical interpretation.

Electronic health records are an increasingly important resource for understanding the interactions between patient health, environment, and clinical decisions. In this paper we report an empirical study of predictive modeling of several patient outcomes using three state-of-the-art machine learning methods. Our primary goal is to validate the models by interpreting the importance of predictors in the final models. Central to interpretation is the use of feature importance scores, which vary depending on the underlying methodology. In order to assess feature importance, we compared univariate statistical tests, information-theoretic measures, permutation testing, and normalized coefficients from multivariate logistic regression models. In general we found poor correlation between methods in their assessment of feature importance, even when their performance is comparable and relatively good. However, permutation tests applied to random forest and gradient boosting models showed the most agreement, and the importance scores matched the clinical interpretation most frequently.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes