CRLGApr 20

A Quasi-Experimental Developer Study of Security Training in LLM-Assisted Web Application Development

arXiv:2604.1776368.3h-index: 4
Predicted impact top 22% in CR · last 90 daysOriginality Synthesis-oriented
AI Analysis

For developers using LLM-assisted coding, this study shows that targeted security training can improve code security without modifying the model, though it is incremental and does not replace existing security practices.

A quasi-experimental study found that a layer-based security training package significantly reduced validated security weaknesses in LLM-assisted Java Spring Boot development, with a 38.2% reduction in severity-weighted burden and a 79.2% reduction in critical findings.

This paper presents a controlled quasi-experimental developer study examining whether a layer-based security training package is associated with improved security quality in LLM-assisted implementation of an identity-centric Java Spring Boot backend. The study uses a mixed design with a within-subject pre-training versus post-training comparison and an exploratory between-subject expertise factor. Twelve developers completed matched runs under a common interface, fixed model configuration, counterbalanced task sets, and a shared starter project. Security outcomes were assessed via independent manual validation of submitted repositories by the first and second authors. The primary participant-level endpoint was a severity-weighted validated-weakness score. The post-training condition showed a significant paired reduction under an exact Wilcoxon signed-rank test ($p = 0.0059$). In aggregate, validated weaknesses decreased from 162 to 111 (31.5\%), the severity-weighted burden decreased from 432 to 267 (38.2\%), and critical findings decreased from 24 to 5 (79.2\%). The largest reductions were in authorization and object access (53.3\%) and in authentication, credential policy, and recovery weaknesses (44.7\%). Session and browser trust-boundary issues showed minimal change, while sensitive-data and cryptographic weaknesses showed only marginal improvement. These results suggest that, under the tested conditions, post-training runs reduce validated security burden in LLM-assisted backend development without modifying the model. They do not support replacing secure defaults, static analysis, expert review, or operational hardening.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes