SE LGMay 31, 2023

Better patching using LLM prompting, via Self-Consistency

arXiv:2306.00108v220.262 citations

Originality Incremental advance

AI Analysis

This addresses the lack of explanations in software datasets for software engineers, though it is incremental as it adapts an existing method to a new domain.

The paper tackles the problem of applying self-consistency prompting to program repair by using commit logs as explanations in few-shot examples, achieving state-of-the-art results on the MODIT dataset.

Large Language models (LLMs) can be induced to solve non-trivial problems with "few-shot" prompts including illustrative problem-solution examples. Now if the few-shots also include "chain of thought" (CoT) explanations, which are of the form problem-explanation-solution, LLMs will generate a "explained" solution, and perform even better. Recently an exciting, substantially better technique, self-consistency [1] (S-C) has emerged, based on the intuition that there are many plausible explanations for the right solution; when the LLM is sampled repeatedly to generate a pool of explanation-solution pairs, for a given problem, the most frequently occurring solutions in the pool (ignoring the explanations) tend to be even more likely to be correct! Unfortunately, the use of this highly-performant S-C (or even CoT) approach in software engineering settings is hampered by the lack of explanations; most software datasets lack explanations. In this paper, we describe an application of the S-C approach to program repair, using the commit log on the fix as the explanation, only in the illustrative few-shots. We achieve state-of-the art results, beating previous approaches to prompting-based program repair, on the MODIT dataset; we also find evidence suggesting that the correct commit messages are helping the LLM learn to produce better patches.

View on arXiv PDF

Similar