AISEApr 25, 2023

AI-assisted coding: Experiments with GPT-4

arXiv:2304.13187v1107 citationsh-index: 22
Originality Synthesis-oriented
AI Analysis

This addresses the reliability of AI-assisted coding tools for developers, but it is incremental as it builds on existing GPT-4 capabilities.

The paper investigates GPT-4's ability to generate and refactor code, finding that it improves code quality metrics but requires human validation for accuracy, as many generated tests fail.

Artificial intelligence (AI) tools based on large language models have acheived human-level performance on some computer programming tasks. We report several experiments using GPT-4 to generate computer code. These experiments demonstrate that AI code generation using the current generation of tools, while powerful, requires substantial human validation to ensure accurate performance. We also demonstrate that GPT-4 refactoring of existing code can significantly improve that code along several established metrics for code quality, and we show that GPT-4 can generate tests with substantial coverage, but that many of the tests fail when applied to the associated code. These findings suggest that while AI coding tools are very powerful, they still require humans in the loop to ensure validity and accuracy of the results.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes