SEMar 17, 2017

Bellwethers: A Baseline Method For Transfer Learning

arXiv:1703.06218v426.282 citations

Originality Incremental advance

AI Analysis

This provides a simple baseline method for transfer learning in software engineering tasks like defect prediction, though it is incremental as it builds on existing data mining techniques.

The paper tackles the problem of conclusion instability in software analytics by proposing 'bellwethers', a project whose data yields the best predictions across others, which reduces instability and yields comparable predictions to other transfer learning methods.

Software analytics builds quality prediction models for software projects. Experience shows that (a) the more projects studied, the more varied are the conclusions; and (b) project managers lose faith in the results of software analytics if those results keep changing. To reduce this conclusion instability, we propose the use of "bellwethers": given N projects from a community the bellwether is the project whose data yields the best predictions on all others. The bellwethers offer a way to mitigate conclusion instability because conclusions about a community are stable as long as this bellwether continues as the best oracle. Bellwethers are also simple to discover (just wrap a for-loop around standard data miners). When compared to other transfer learning methods (TCA+, transfer Naive Bayes, value cognitive boosting), using just the bellwether data to construct a simple transfer learner yields comparable predictions. Further, bellwethers appear in many SE tasks such as defect prediction, effort estimation, and bad smell detection. We hence recommend using bellwethers as a baseline method for transfer learning against which future work should be compared

View on arXiv PDF

Similar