AI SEApr 25, 2023

AI-assisted coding: Experiments with GPT-4

Russell A Poldrack, Thomas Lu, Gašper Beguš

arXiv:2304.13187v125.1107 citationsh-index: 22Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the reliability of AI-assisted coding tools for developers, but it is incremental as it builds on existing GPT-4 capabilities.

The paper investigates GPT-4's ability to generate and refactor code, finding that it improves code quality metrics but requires human validation for accuracy, as many generated tests fail.

Artificial intelligence (AI) tools based on large language models have acheived human-level performance on some computer programming tasks. We report several experiments using GPT-4 to generate computer code. These experiments demonstrate that AI code generation using the current generation of tools, while powerful, requires substantial human validation to ensure accurate performance. We also demonstrate that GPT-4 refactoring of existing code can significantly improve that code along several established metrics for code quality, and we show that GPT-4 can generate tests with substantial coverage, but that many of the tests fail when applied to the associated code. These findings suggest that while AI coding tools are very powerful, they still require humans in the loop to ensure validity and accuracy of the results.

View on arXiv PDF Code

Similar