SELGDec 25, 2023

RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair

arXiv:2312.15698v6105 citationsh-index: 51IEEE Trans Softw Eng
Originality Incremental advance
AI Analysis

This addresses the challenge of scaling program repair with frontier models for developers, though it appears incremental by building on existing fine-tuning and PEFT techniques.

The paper tackled the problem of fine-tuning large language models for automated program repair by proposing RepairLLaMA, which identifies optimal code representations and uses parameter-efficient fine-tuning, resulting in correctly fixing 144 Defects4J v2, 109 HumanEval-Java, and 20 GitBug-Java bugs and outperforming all baselines.

Automated Program Repair (APR) has evolved significantly with the advent of Large Language Models (LLMs). Fine-tuning LLMs for program repair is a recent avenue of research, with many dimensions which have not been explored. Existing work mostly fine-tune LLMs with naive code representations and does not scale to frontier models. To address this problem, we propose RepairLLaMA, a novel program repair approach that 1) identifies optimal code representations for APR with fine-tuned models, and 2) pioneers state-of-the-art parameter-efficient fine-tuning technique (PEFT) for program repair. This results in RepairLLaMA producing a highly effective `program repair adapter' for fixing bugs with AI. Our experiments demonstrate the validity of both concepts. First, fine-tuning adapters with program repair specific code representations enables the model to use meaningful repair signals and produce better patches. Second, parameter-efficient fine-tuning helps fine-tuning to converge and clearly contributes to the effectiveness of RepairLLaMA in fixing bugs outside the fine-tuning data distribution. Overall, RepairLLaMA correctly fixes 144 Defects4J v2, 109 HumanEval-Java, and 20 GitBug-Java bugs, outperforming all baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes