AI LOOct 14, 2024

3D-Prover: Diversity Driven Theorem Proving With Determinantal Point Processes

Sean Lamont, Christian Walder, Amir Dezfouli, Paul Montague, Michael Norrish

arXiv:2410.11133v25.83 citationsh-index: 1Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of inefficient proof search for automated reasoning systems, though it is incremental as it augments existing tactic generators rather than introducing a new paradigm.

The paper tackles the intractable search space in automated theorem proving by pruning semantically similar or error-prone tactics, using synthetic data and Determinantal Point Processes to select diverse, high-quality tactics, resulting in increased proof rates and improved tactic success rates, execution times, and diversity on benchmarks like miniF2F and LeanDojo.

A key challenge in automated formal reasoning is the intractable search space, which grows exponentially with the depth of the proof. This branching is caused by the large number of candidate proof tactics which can be applied to a given goal. Nonetheless, many of these tactics are semantically similar or lead to an execution error, wasting valuable resources in both cases. We address the problem of effectively pruning this search, using only synthetic data generated from previous proof attempts. We first demonstrate that it is possible to generate semantically aware tactic representations which capture the effect on the proving environment, likelihood of success, and execution time. We then propose a novel filtering mechanism which leverages these representations to select semantically diverse and high quality tactics, using Determinantal Point Processes. Our approach, 3D- Prover, is designed to be general, and to augment any underlying tactic generator. We demonstrate the effectiveness of 3D-Prover on the miniF2F and LeanDojo benchmarks by augmenting popular open source proving LLMs. We show that our approach leads to an increase in the overall proof rate, as well as a significant improvement in the tactic success rate, execution time and diversity. We make our code available at https://github.com/sean-lamont/3D-Prover.

View on arXiv PDF Code

Similar