CRSEMay 11

Usability as a Weapon: Attacking the Safety of LLM-Based Code Generation via Usability Requirements

arXiv:2605.1013389.3
Predicted impact top 7% in CR · last 90 daysOriginality Highly original
AI Analysis

For developers and security practitioners relying on LLM-based code generation, this reveals a practical attack surface where explicit usability goals can silently undermine security.

The paper introduces UPAttack, a threat where usability requirements cause LLMs to drop security constraints, and proposes U-SPLOIT to automate such attacks, achieving up to 98.1% success rate on state-of-the-art models across multiple languages.

Large Language Models (LLMs) are increasingly used for automated software development, making their ability to preserve secure coding practices critical. In practice, however, many security requirements are implicit or underspecified, whereas usability requirements are explicit and high-signal. This asymmetry motivates our investigation of usability pressure as a practical attack surface: realistic usability-oriented requirements (e.g., new features, performance constraints, or simplicity demands) can cause coding LLMs to satisfy explicit usability goals while silently dropping implicit security constraints -- a form of reward hacking. We formalize this threat as UPAttack and propose U-SPLOIT, an automated framework to craft UPAttack that (i) selects tasks where a model is initially secure, (ii) synthesizes usability pressures by identifying usability rewards of insecure alternatives across three vectors (Functionality, Implementation, Trade-off), and (iii) verifies security regression via both existing test cases and dynamically generated exploit payloads. Across 75 seed scenarios (25 CWEs x 3 cases), spanning multiple languages (Python, C, and JavaScript), U-SPLOIT achieves attack success rates up to 98.1% on multiple state-of-the-art models (e.g., GPT-5.2-chat and Gemini-3-Flash-Preview).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes