Enhancing Reliability in LLM-Based Secure Code Generation

Mohammed F. Kharma, Mohammad Alkhanafseh, Ahmed Sabbah, David Mohaisen

arXiv:2605.2430070.8

Predicted impact top 20% in CR · last 90 daysOriginality Incremental advance

AI Analysis

For developers using LLMs for code generation, MA-CoT provides a prompting strategy that consistently improves security reliability, unlike existing methods that may increase vulnerabilities.

The paper introduces MA-CoT, a framework that embeds CWE mitigation guidance and language-aware safeguards into chain-of-thought prompting to reduce security vulnerabilities in LLM-generated code. Across multiple LLMs and languages, MA-CoT reduces total security findings by 57.6% on a primary dataset and 94.5% on LLMSecEval, with high-severity findings dropping by 56.7% and 95.6% respectively.

Large language models (LLMs) are widely used for code generation, but their security reliability remains inconsistent across languages and prompting strategies. Existing prompt engineering improves functional correctness but rarely ensures consistent security outcomes. We introduce the \textit{Mitigation-Aware Chain-of-Thought (MA-CoT)} framework, which embeds task-specific CWE mitigation guidance and language-aware safeguards to reduce recurring vulnerabilities in generated code. We evaluate MA-CoT across three LLMs (gpt-5, claude-4.5, gemini-2.5), three programming languages (C, Java, Python), and four prompting strategies (Vanilla, Zero-shot, CoT, MA-CoT) on a 200-task primary dataset, with external validation on LLMSecEval. Using static analysis with expert validation, MA-CoT reduces total security findings from 92 to 39 (57.6\%) on the primary dataset and from 73 to 4 (94.5\%) on LLMSecEval. High-severity findings (Blocker + Critical) drop from 90 to 39 (56.7\%) and from 45 to 2 (95.6\%), respectively. Across both datasets, MA-CoT is the only strategy that consistently improves security reliability; Zero-shot and CoT are less reliable and may increase vulnerability, especially in C. We further introduce a strict layered attribution of vulnerability drivers (language-core vs. stack layers) and show that residual risk concentrates in hardening-oriented patterns (e.g., OS- and toolchain-dependent), motivating secure-by-construction primitives alongside prompting.

View on arXiv PDF

Similar