SEMay 27

Confident Learning-based Network for Detecting Bug-Inducing Commits on SZZ with Noisy Labels

arXiv:2605.2788013.2Has Code
Predicted impact top 87% in SE · last 90 daysOriginality Incremental advance
AI Analysis

For software engineering teams, this work improves the accuracy of identifying bug-inducing commits, but the approach is incremental, combining existing techniques (confident learning, graph convolutional networks) for a known bottleneck.

The paper proposes BIC-Hunter, a model for detecting bug-inducing commits that addresses data noise and semantic gaps in JIT defect prediction. It achieves state-of-the-art results, outperforming existing methods by 6.16%, 7.13%, and 5.53% on Recall@1, Recall@2, and Recall@3, respectively, with MFR improvements of 8.43% to 32.82%.

The Just-In-Time (JIT) defect prediction model serves as a critical tool for ensuring the quality of software development and enhancing software performance. It assists development teams in promptly identifying and addressing potential issues by predicting whether code submissions may introduce defects. However, due to the existence of data noise and insufficient semantic connections in real-world scenarios, existing approaches face challenges in accurately identifying the code commits that introduce defects and capturing the potential semantic relationships. To address these challenges, we propose the BIC- Hunter(Bug-Inducing Commits Hunter) model, which mitigates data noise and improves semantic understanding, thereby enhancing the accuracy of bug-inducing commit identification. BIC - Hunter model consists of two components: a data denoising component and a semantic relationship capturing component. Specifically, the data denoising component addresses the challenges posed by inaccurate annotations and inconsistencies in real-world data, enhancing the reliability of training data and improving overall model robustness. The semantic relation- ship capturing component constructs homogeneous graphs and applies graph convolutional networks to facilitate a more comprehensive analysis of code context, enabling the identification of defects caused by code commits and enhancing the confidence in pinpointing their root causes. Experimental studies on a large-scale dataset integrated from three open-source datasets show that BIC- Hunter exhibits outstanding performance. BIC- Hunter outperforms the state-of-the-art by 6.16%, 7.13%, and 5.53% on Recall@1, Recall@2, and Recall@3, respectively, while the MFR index increases by 8.43% to 32.82%. These results demonstrate the superior capability of our method in identifying bug-inducing commits.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes