SEAICLFeb 22, 2024

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

arXiv:2402.14658v3257 citationsh-index: 28Has CodeACL
Originality Incremental advance
AI Analysis

This work addresses the problem of limited execution and refinement in open-source code generation models for developers and researchers, representing a significant but incremental improvement over existing methods.

The paper tackles the gap in execution and iterative refinement capabilities between open-source code generation models and proprietary systems like GPT-4 Code Interpreter by introducing OpenCodeInterpreter, a family of open-source systems that integrate code generation with execution and refinement, achieving up to 91.6% accuracy on benchmarks with feedback.

The introduction of large language models has significantly advanced code generation. However, open-source models often lack the execution capabilities and iterative refinement of advanced systems like the GPT-4 Code Interpreter. To address this, we introduce OpenCodeInterpreter, a family of open-source code systems designed for generating, executing, and iteratively refining code. Supported by Code-Feedback, a dataset featuring 68K multi-turn interactions, OpenCodeInterpreter integrates execution and human feedback for dynamic code refinement. Our comprehensive evaluation of OpenCodeInterpreter across key benchmarks such as HumanEval, MBPP, and their enhanced versions from EvalPlus reveals its exceptional performance. Notably, OpenCodeInterpreter-33B achieves an accuracy of 83.2 (76.4) on the average (and plus versions) of HumanEval and MBPP, closely rivaling GPT-4's 84.2 (76.2) and further elevates to 91.6 (84.6) with synthesized human feedback from GPT-4. OpenCodeInterpreter brings the gap between open-source code generation models and proprietary systems like GPT-4 Code Interpreter.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes