SESep 6, 2016

Tuning for Software Analytics: is it Really Necessary?

arXiv:1609.01759v1219 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the need for efficient tuning methods in software engineering, specifically for defect prediction, and suggests a change in standard practices, though it is incremental in applying existing optimization to a known bottleneck.

The paper tackled the problem of tuning data miners in software analytics, finding that simple automatic tuning using differential evolution can significantly improve defect prediction precision from 0% to 60% with only tens of attempts.

Context: Data miners have been widely used in software engineering to, say, generate defect predictors from static code measures. Such static code defect predictors perform well compared to manual methods, and they are easy to use and useful to use. But one of the "black art" of data mining is setting the tunings that control the miner. Objective:We seek simple, automatic, and very effective method for finding those tunings. Method: For each experiment with different data sets (from open source JAVA systems), we ran differential evolution as anoptimizer to explore the tuning space (as a first step) then tested the tunings using hold-out data. Results: Contrary to our prior expectations, we found these tunings were remarkably simple: it only required tens, not thousands,of attempts to obtain very good results. For example, when learning software defect predictors, this method can quickly find tuningsthat alter detection precision from 0% to 60%. Conclusion: Since (1) the improvements are so large, and (2) the tuning is so simple, we need to change standard methods insoftware analytics. At least for defect prediction, it is no longer enough to just run a data miner and present the resultwithoutconducting a tuning optimization study. The implication for other kinds of analytics is now an open and pressing issue

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes