SEMar 17, 2017

Bellwethers: A Baseline Method For Transfer Learning

arXiv:1703.06218v482 citations
Originality Incremental advance
AI Analysis

This provides a simple baseline method for transfer learning in software engineering tasks like defect prediction, though it is incremental as it builds on existing data mining techniques.

The paper tackles the problem of conclusion instability in software analytics by proposing 'bellwethers', a project whose data yields the best predictions across others, which reduces instability and yields comparable predictions to other transfer learning methods.

Software analytics builds quality prediction models for software projects. Experience shows that (a) the more projects studied, the more varied are the conclusions; and (b) project managers lose faith in the results of software analytics if those results keep changing. To reduce this conclusion instability, we propose the use of "bellwethers": given N projects from a community the bellwether is the project whose data yields the best predictions on all others. The bellwethers offer a way to mitigate conclusion instability because conclusions about a community are stable as long as this bellwether continues as the best oracle. Bellwethers are also simple to discover (just wrap a for-loop around standard data miners). When compared to other transfer learning methods (TCA+, transfer Naive Bayes, value cognitive boosting), using just the bellwether data to construct a simple transfer learner yields comparable predictions. Further, bellwethers appear in many SE tasks such as defect prediction, effort estimation, and bad smell detection. We hence recommend using bellwethers as a baseline method for transfer learning against which future work should be compared

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes