SELGJun 27, 2023

A Meta-analytical Comparison of Naive Bayes and Random Forest for Software Defect Prediction

arXiv:2306.15369v13 citationsh-index: 58
Originality Synthesis-oriented
AI Analysis

This provides clarity for software engineering researchers and practitioners on model selection, though it is incremental as it synthesizes existing evidence.

The study tackled the problem of comparing Naive Bayes and Random Forest for software defect prediction using meta-analysis, finding no significant statistical difference in recall, f-measure, and precision between the two models.

Is there a statistical difference between Naive Bayes and Random Forest in terms of recall, f-measure, and precision for predicting software defects? By utilizing systematic literature review and meta-analysis, we are answering this question. We conducted a systematic literature review by establishing criteria to search and choose papers, resulting in five studies. After that, using the meta-data and forest-plots of five chosen papers, we conducted a meta-analysis to compare the two models. The results have shown that there is no significant statistical evidence that Naive Bayes perform differently from Random Forest in terms of recall, f-measure, and precision.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes