Explainable predictions of different machine learning algorithms used to predict Early Stage diabetes
This work provides an incremental comparison of standard methods for diabetes diagnosis, aiding healthcare practitioners in selecting algorithms for early detection.
The paper compared multiple machine learning algorithms, including Random Forest, Decision Tree, and XGBoost, to predict early-stage diabetes using a hospital dataset, finding that Random Forest achieved the highest accuracy of 99%.
Machine Learning and Artificial Intelligence can be widely used to diagnose chronic diseases so that necessary precautionary treatment can be done in critical time. Diabetes Mellitus which is one of the major diseases can be easily diagnosed by several Machine Learning algorithms. Early stage diagnosis is crucial to prevent dangerous consequences. In this paper we have made a comparative analysis of several machine learning algorithms viz. Random Forest, Decision Tree, Artificial Neural Networks, K Nearest Neighbor, Support Vector Machine, and XGBoost along with feature attribution using SHAP to identify the most important feature in predicting the diabetes on a dataset collected from Sylhet Hospital. As per the experimental results obtained, the Random Forest algorithm has outperformed all the other algorithms with an accuracy of 99 percent on this particular dataset.