CRLGJan 1

Improving LLM-Assisted Secure Code Generation through Retrieval-Augmented-Generation and Multi-Tool Feedback

arXiv:2601.00509v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in code generated by LLMs for developers, but it is incremental as it builds on prior work with structured feedback and retrieval augmentation.

The paper tackled the problem of LLMs generating insecure code by proposing a retrieval-augmented, multi-tool repair workflow, resulting in a 96% reduction in security vulnerabilities for DeepSeek-Coder-1.3B and a decrease from 58.55% to 22.19% in critical security defects for CodeLlama-7B.

Large Language Models (LLMs) can generate code but often introduce security vulnerabilities, logical inconsistencies, and compilation errors. Prior work demonstrates that LLMs benefit substantially from structured feedback, static analysis, retrieval augmentation, and execution-based refinement. We propose a retrieval-augmented, multi-tool repair workflow in which a single code-generating LLM iteratively refines its outputs using compiler diagnostics, CodeQL security scanning, and KLEE symbolic execution. A lightweight embedding model is used for semantic retrieval of previously successful repairs, providing security-focused examples that guide generation. Evaluated on a combined dataset of 3,242 programs generated by DeepSeek-Coder-1.3B and CodeLlama-7B, the system demonstrates significant improvements in robustness. For DeepSeek, security vulnerabilities were reduced by 96%. For the larger CodeLlama model, the critical security defect rate was decreased from 58.55% to 22.19%, highlighting the efficacy of tool-assisted self-repair even on "stubborn" models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes