CLAIJun 14, 2023

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

Microsoft
arXiv:2306.08568v2966 citationsh-index: 41Has Code
Originality Highly original
AI Analysis

This addresses the problem of improving code generation performance for developers and researchers, representing a strong incremental advance in fine-tuning methods for Code LLMs.

The paper tackles the lack of instruction fine-tuning in Code LLMs by introducing WizardCoder, which uses Evol-Instruct for complex instruction fine-tuning, achieving state-of-the-art results on benchmarks like HumanEval and HumanEval+.

Code Large Language Models (Code LLMs), such as StarCoder, have demonstrated exceptional performance in code-related tasks. However, most existing models are solely pre-trained on extensive raw code data without instruction fine-tuning. In this paper, we introduce WizardCoder, which empowers Code LLMs with complex instruction fine-tuning, by adapting the Evol-Instruct method to the domain of code. Through comprehensive experiments on four prominent code generation benchmarks, namely HumanEval, HumanEval+, MBPP, and DS-1000, we unveil the exceptional capabilities of our model. It surpasses all other open-source Code LLMs by a substantial margin. Moreover, our model even outperforms the largest closed LLMs, Anthropic's Claude and Google's Bard, on HumanEval and HumanEval+. Our code, model weights, and data are public at https://github.com/nlpxucan/WizardLM

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes