CLAIMay 25, 2022

NaturalProver: Grounded Mathematical Proof Generation with Language Models

UW
arXiv:2205.12910v299 citationsh-index: 111
Originality Incremental advance
AI Analysis

This addresses the problem of automated mathematical reasoning for AI research and education, representing an incremental advance in applying language models to this domain.

The authors tackled theorem proving in natural mathematical language by developing NaturalProver, a language model that generates proofs using background references, which improved proof quality over fine-tuned GPT-3 and achieved over 40% correct and useful next-step suggestions.

Theorem proving in natural mathematical language - the mixture of symbolic and natural language used by humans - plays a central role in mathematical advances and education, and tests aspects of reasoning that are core to intelligence. Yet it has remained underexplored with modern generative models. We study large-scale language models on two new generation tasks: suggesting the next step in a mathematical proof, and full proof generation. We develop NaturalProver, a language model that generates proofs by conditioning on background references (e.g. theorems and definitions that are either retrieved or human-provided), and optionally enforces their presence with constrained decoding. On theorems from the NaturalProofs benchmark, NaturalProver improves the quality of next-step suggestions and generated proofs over fine-tuned GPT-3, according to human evaluations from university-level mathematics students. NaturalProver is capable of proving some theorems that require short (2-6 step) proofs, and providing next-step suggestions that are rated as correct and useful over 40% of the time, which is to our knowledge the first demonstration of these capabilities using neural language models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes