SEApr 8

Program Analysis Guided LLM Agent for Proof-of-Concept Generation

arXiv:2604.0762468.0
Predicted impact top 28% in SE · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the problem for software developers needing reliable vulnerability reproduction, representing a strong incremental advance in automated PoC generation.

The paper tackles the challenge of automatically generating proof-of-concept inputs for software vulnerabilities by introducing PAGENT, a hybrid approach that integrates static and dynamic analysis guidance with an LLM agent, resulting in a 132% improvement over prior methods.

Software developers frequently receive vulnerability reports that require them to reproduce the vulnerability in a reliable manner by generating a proof-of-concept (PoC) input that triggers it. Given the source code for a software project and a specific code location for a potential vulnerability, automatically generating a PoC for the given vulnerability has been a challenging research problem. Symbolic execution and fuzzing techniques require expert guidance and manual steps and face scalability challenges for PoC generation. Although recent advances in LLMs have increased the level of automation and scalability, the success rate of PoC generation with LLMs remains quite low. In this paper, we present a novel approach called Program Analysis Guided proof of concept generation agENT (PAGENT) that is scalable and significantly improves the success rate of automated PoC generation compared to prior results. PAGENT integrates lightweight and rule-based static analysis phases for providing static analysis guidance and sanitizer-based profiling and coverage information for providing dynamic analysis guidance with a PoC generation agent. Our experiments demonstrate that the resulting hybrid approach significantly outperforms the prior top-performing agentic approach by 132% for the PoC generation task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes