MEAIGNMLMay 5, 2019

Decision Making with Machine Learning and ROC Curves

arXiv:1905.02810v125 citations
Originality Synthesis-oriented
AI Analysis

It addresses model selection challenges in binary classification for healthcare decision-making, but appears incremental as it builds on existing ROC curve theory.

The paper analyzes the statistical properties of ROC curves and their implications for model selection, using a theoretical framework to study incentive heterogeneity and information asymmetry in human decisions, illustrated with a dataset of pregnancy outcomes and doctor diagnoses from China.

The Receiver Operating Characteristic (ROC) curve is a representation of the statistical information discovered in binary classification problems and is a key concept in machine learning and data science. This paper studies the statistical properties of ROC curves and its implication on model selection. We analyze the implications of different models of incentive heterogeneity and information asymmetry on the relation between human decisions and the ROC curves. Our theoretical discussion is illustrated in the context of a large data set of pregnancy outcomes and doctor diagnosis from the Pre-Pregnancy Checkups of reproductive age couples in Henan Province provided by the Chinese Ministry of Health.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes