SE AI CRJan 9, 2025

Automating the Detection of Code Vulnerabilities by Analyzing GitHub Issues

Daniele Cipollone, Changjie Wang, Mariano Scazzariello, Simone Ferlin, Maliheh Izadi, Dejan Kostic, Marco Chiesa

arXiv:2501.05258v15.93 citationsh-index: 6Has Code2025 IEEE/ACM International Workshop on Large Language Models for Code (LLM4Code)

Originality Synthesis-oriented

AI Analysis

This addresses the problem of timely vulnerability detection for open-source software ecosystems, though it appears incremental as it applies existing methods to a new dataset.

The paper tackles automated detection of software vulnerabilities by analyzing GitHub issues using transformer-based models and machine learning, achieving results that demonstrate potential for real-world application in early detection to reduce exploitation windows.

In today's digital landscape, the importance of timely and accurate vulnerability detection has significantly increased. This paper presents a novel approach that leverages transformer-based models and machine learning techniques to automate the identification of software vulnerabilities by analyzing GitHub issues. We introduce a new dataset specifically designed for classifying GitHub issues relevant to vulnerability detection. We then examine various classification techniques to determine their effectiveness. The results demonstrate the potential of this approach for real-world application in early vulnerability detection, which could substantially reduce the window of exploitation for software vulnerabilities. This research makes a key contribution to the field by providing a scalable and computationally efficient framework for automated detection, enabling the prevention of compromised software usage before official notifications. This work has the potential to enhance the security of open-source software ecosystems.

View on arXiv PDF

Similar