CYLGQMMLDec 11, 2018

Classification of Cervical Cancer Dataset

arXiv:1812.10383v138 citations
Originality Synthesis-oriented
AI Analysis

This work provides an incremental improvement in classification accuracy for cervical cancer diagnosis, which could aid medical professionals in early detection.

The paper tackled cervical cancer classification using a dataset with 858 samples and 32 attributes, addressing missing values and imbalance through sampling techniques and feature selection, achieving 97.5% accuracy with key predictive features like age and number of pregnancies.

Cervical cancer is the leading gynecological malignancy worldwide. This paper presents diverse classification techniques and shows the advantage of feature selection approaches to the best predicting of cervical cancer disease. There are thirty-two attributes with eight hundred and fifty-eight samples. Besides, this data suffers from missing values and imbalance data. Therefore, over-sampling, under-sampling and embedded over and under sampling have been used. Furthermore, dimensionality reduction techniques are required for improving the accuracy of the classifier. Therefore, feature selection methods have been studied as they divided into two distinct categories, filters and wrappers. The results show that age, first sexual intercourse, number of pregnancies, smokes, hormonal contraceptives, and STDs: genital herpes are the main predictive features with high accuracy with 97.5%. Decision Tree classifier is shown to be advantageous in handling classification assignment with excellent performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes