CLAIPLMar 10, 2022

Compilable Neural Code Generation with Compiler Feedback

arXiv:2203.05132v1660 citationsh-index: 33
Originality Incremental advance
AI Analysis

This addresses the challenge of ensuring compilability in automated code generation for software engineering and computational linguistics, representing a strong incremental improvement over existing approaches.

The paper tackled the problem of generating compilable code by proposing COMPCODER, a three-stage pipeline that uses compiler feedback, which improved compilation success rates from 44.18% to 89.18% in code completion and from 70.3% to 96.2% in text-to-code generation compared to state-of-the-art methods.

Automatically generating compilable programs with (or without) natural language descriptions has always been a touchstone problem for computational linguistics and automated software engineering. Existing deep-learning approaches model code generation as text generation, either constrained by grammar structures in decoder, or driven by pre-trained language models on large-scale code corpus (e.g., CodeGPT, PLBART, and CodeT5). However, few of them account for compilability of the generated programs. To improve compilability of the generated programs, this paper proposes COMPCODER, a three-stage pipeline utilizing compiler feedback for compilable code generation, including language model fine-tuning, compilability reinforcement, and compilability discrimination. Comprehensive experiments on two code generation tasks demonstrate the effectiveness of our proposed approach, improving the success rate of compilation from 44.18 to 89.18 in code completion on average and from 70.3 to 96.2 in text-to-code generation, respectively, when comparing with the state-of-the-art CodeGPT.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes