SEAILOOct 25, 2024

CoqPilot, a plugin for LLM-based generation of proofs

arXiv:2410.19605v115 citationsh-index: 2Has CodeASE
Originality Synthesis-oriented
AI Analysis

This tool addresses the challenge of proof automation in Coq for users in formal verification, though it appears incremental as it builds on existing LLM and proof generation methods.

The authors tackled the problem of automating Coq proof writing by developing CoqPilot, a VS Code extension that uses LLMs and non-machine-learning methods to generate and verify proof candidates for holes in proofs, resulting in a tool that provides a zero-setup experience and includes a benchmarking system for experiments.

We present CoqPilot, a VS Code extension designed to help automate writing of Coq proofs. The plugin collects the parts of proofs marked with the admit tactic in a Coq file, i.e., proof holes, and combines LLMs along with non-machine-learning methods to generate proof candidates for the holes. Then, CoqPilot checks if each proof candidate solves the given subgoal and, if successful, replaces the hole with it. The focus of CoqPilot is twofold. Firstly, we want to allow users to seamlessly combine multiple Coq generation approaches and provide a zero-setup experience for our tool. Secondly, we want to deliver a platform for LLM-based experiments on Coq proof generation. We developed a benchmarking system for Coq generation methods, available in the plugin, and conducted an experiment using it, showcasing the framework's possibilities. Demo of CoqPilot is available at: https://youtu.be/oB1Lx-So9Lo. Code at: https://github.com/JetBrains-Research/coqpilot

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes