SEOct 18, 2021

Use and Misuse of the Term Experiment in Mining Software Repositories Research

arXiv:2110.09366v120 citations
Originality Synthesis-oriented
AI Analysis

This addresses methodological clarity for MSR researchers, highlighting a widespread issue that compromises result interpretation, but it is incremental as it critiques existing practices without introducing new methods.

The paper analyzed the use of the term 'experiment' in Mining Software Repositories (MSR) research, finding that 19% of papers misuse it for observational studies, and only 1% of the remaining refer to genuine controlled experiments, with most having limited control.

The significant momentum and importance of Mining Software Repositories (MSR) in Software Engineering (SE) has fostered new opportunities and challenges for extensive empirical research. However, MSR researchers seem to struggle to characterize the empirical methods they use into the existing empirical SE body of knowledge. This is especially the case of MSR experiments. To provide evidence on the special characteristics of MSR experiments and their differences with experiments traditionally acknowledged in SE so far, we elicited the hallmarks that differentiate an experiment from other types of empirical studies and characterized the hallmarks and types of experiments in MSR. We analyzed MSR literature obtained from a small-scale systematic mapping study to assess the use of the term experiment in MSR. We found that 19% of the papers claiming to be an experiment are indeed not an experiment at all but also observational studies, so they use the term in a misleading way. From the remaining 81% of the papers, only one of them refers to a genuine controlled experiment while the others stand for experiments with limited control. MSR researchers tend to overlook such limitations, compromising the interpretation of the results of their studies. We provide recommendations and insights to support the improvement of MSR experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes