LG AI SEJan 30, 2025

Towards Transparent and Accurate Diabetes Prediction Using Machine Learning and Explainable Artificial Intelligence

Pir Bakhsh Khokhar, Viviana Pentangelo, Fabio Palomba, Carmine Gravino

arXiv:2501.18071v24.117 citationsh-index: 9

Originality Synthesis-oriented

AI Analysis

It addresses early diagnosis and management of diabetes for healthcare systems, but is incremental as it applies existing methods to a specific dataset.

This study tackled diabetes prediction by developing a machine learning framework with explainable AI tools, achieving a test accuracy of 92.50% and ROC-AUC of 0.975.

Diabetes mellitus (DM) is a global health issue of significance that must be diagnosed as early as possible and managed well. This study presents a framework for diabetes prediction using Machine Learning (ML) models, complemented with eXplainable Artificial Intelligence (XAI) tools, to investigate both the predictive accuracy and interpretability of the predictions from ML models. Data Preprocessing is based on the Synthetic Minority Oversampling Technique (SMOTE) and feature scaling used on the Diabetes Binary Health Indicators dataset to deal with class imbalance and variability of clinical features. The ensemble model provided high accuracy, with a test accuracy of 92.50% and an ROC-AUC of 0.975. BMI, Age, General Health, Income, and Physical Activity were the most influential predictors obtained from the model explanations. The results of this study suggest that ML combined with XAI is a promising means of developing accurate and computationally transparent tools for use in healthcare systems.

View on arXiv PDF

Similar