LGMLMay 30, 2013

Predicting the Severity of Breast Masses with Data Mining Methods

arXiv:1305.7057v122 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of reducing unnecessary biopsies in breast cancer screening for physicians, but it is incremental as it applies existing methods to a specific dataset without major innovations.

The study applied data mining classification algorithms to predict the severity of breast masses as benign or malignant using BI-RADS attributes and patient age, achieving accuracies of 78.12% for Decision Tree, 80.56% for Artificial Neural Network, and 81.25% for Support Vector Machine on test samples.

Mammography is the most effective and available tool for breast cancer screening. However, the low positive predictive value of breast biopsy resulting from mammogram interpretation leads to approximately 70% unnecessary biopsies with benign outcomes. Data mining algorithms could be used to help physicians in their decisions to perform a breast biopsy on a suspicious lesion seen in a mammogram image or to perform a short term follow-up examination instead. In this research paper data mining classification algorithms; Decision Tree (DT), Artificial Neural Network (ANN), and Support Vector Machine (SVM) are analyzed on mammographic masses data set. The purpose of this study is to increase the ability of physicians to determine the severity (benign or malignant) of a mammographic mass lesion from BI-RADS attributes and the patient,s age. The whole data set is divided for training the models and test them by the ratio of 70:30% respectively and the performances of classification algorithms are compared through three statistical measures; sensitivity, specificity, and classification accuracy. Accuracy of DT, ANN and SVM are 78.12%, 80.56% and 81.25% of test samples respectively. Our analysis shows that out of these three classification models SVM predicts severity of breast cancer with least error rate and highest accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes