SE AI PLMay 13, 2025

AI-Mediated Code Comment Improvement

Maria Dhakal, Chia-Yi Su, Robert Wallace, Chris Fakhimi, Aakash Bansal, Toby Li, Yu Huang, Collin McMillan

arXiv:2505.09021v13.4h-index: 12

Originality Synthesis-oriented

AI Analysis

This addresses code readability and maintenance issues for software developers, though it is incremental as it applies existing LLM methods to a specific domain.

The paper tackles the problem of improving code comment quality by developing an AI-based rewriting procedure using GPT-4o and a distilled model, showing improvements along empirically determined quality axes.

This paper describes an approach to improve code comments along different quality axes by rewriting those comments with customized Artificial Intelligence (AI)-based tools. We conduct an empirical study followed by grounded theory qualitative analysis to determine the quality axes to improve. Then we propose a procedure using a Large Language Model (LLM) to rewrite existing code comments along the quality axes. We implement our procedure using GPT-4o, then distil the results into a smaller model capable of being run in-house, so users can maintain data custody. We evaluate both our approach using GPT-4o and the distilled model versions. We show in an evaluation how our procedure improves code comments along the quality axes. We release all data and source code in an online repository for reproducibility.

View on arXiv PDF

Similar