Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation
This addresses cheating issues for instructors in computer science education, though it is incremental as it builds on existing adversarial methods.
The paper tackled the problem of LLM-assisted cheating in introductory programming courses by testing adversarial perturbations on 5 widely used LLMs, resulting in a 77% reduction in average correctness score for code generation.
While Large language model (LLM)-based programming assistants such as CoPilot and ChatGPT can help improve the productivity of professional software developers, they can also facilitate cheating in introductory computer programming courses. Assuming instructors have limited control over the industrial-strength models, this paper investigates the baseline performance of 5 widely used LLMs on a collection of introductory programming problems, examines adversarial perturbations to degrade their performance, and describes the results of a user study aimed at understanding the efficacy of such perturbations in hindering actual code generation for introductory programming assignments. The user study suggests that i) perturbations combinedly reduced the average correctness score by 77%, ii) the drop in correctness caused by these perturbations was affected based on their detectability.