CRSEMar 29

How Can ChatGPT Support Human Security Testers to Help Mitigate Supply Chain Attacks?

arXiv:2310.0071079.363 citationsh-index: 21
AI Analysis

For developers and security testers, this work provides a practical method to generate concrete exploit evidence for vulnerable dependencies, addressing distrust in automated vulnerability reports.

The study explores using ChatGPT-4.0 to generate security tests that demonstrate how vulnerable library dependencies enable supply chain attacks. ChatGPT generated evidence for 24 out of 49 apps, outperforming existing tools by generating more tests and achieving more attacks, with four CVEs assigned.

Developers often build software on top of third-party libraries (Libs) to improve productivity, but these libraries may contain vulnerabilities that enable supply chain attacks. Existing tools detect vulnerable dependencies, yet developers often distrust their reports without concrete exploit evidence. Manually crafting such demonstrations is costly, and tool support is lacking. To help developers enhance software security, in this study, we systematically explored the usage of a large language model (LLM) --ChatGPT-4.0--to generate security tests, which unit tests demonstrate how vulnerable library dependencies facilitate the supply chain attacks to given Apps. In our exploration, we defined prompt templates to take in the various vulnerability-relevant information we manually collected, and generated prompts from those templates to query ChatGPT for security test generation. We found that ChatGPT-generated tests demonstrated 24 pieces of evidence or proof of vulnerability for 49 Apps. To assess the consistency of test generation, we also evaluated another five state-of-the-art LLMs. All the models generated security tests for at least 17 cases that successfully demonstrate the vulnerabilities. We filed six reports for the newly revealed vulnerabilities in Apps, and got four Common Vulnerability Entries (CVEs) assigned. Our use of ChatGPT outperformed two state-of-the-art security test generators (TRANSFER and SIEGE), by generating a lot more tests and achieving more attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes