AIMar 23

FormalEvolve: Neuro-Symbolic Evolutionary Search for Diverse and Prover-Effective Autoformalization

arXiv:2603.1982839.11 citationsh-index: 7

AI Analysis

This work addresses the challenge of improving proof-search efficiency in autoformalization for mathematical reasoning and theorem proving, representing an incremental advance with specific performance gains.

The paper tackled the problem of autoformalization by developing FormalEvolve, a neuro-symbolic evolutionary search framework that generates diverse and prover-effective formalizations, achieving semantic hit rates of 58.0% on CombiBench and 84.9% on ProofNet under a budget of 100 calls.

Autoformalization aims to translate natural-language mathematics into compilable, machine-checkable statements. However, semantic consistency does not imply prover effectiveness: even semantically consistent formalizations can differ substantially in proof-search cost and success rate. In this work, we formulate autoformalization as a budgeted, test-time search for semantically consistent repertoires, and propose FormalEvolve, a compilation-gated neuro-symbolic evolutionary framework. FormalEvolve generates diverse candidates via LLM-driven mutation and crossover with bounded patch repair, while symbolic Abstract Syntax Tree (AST) rewrite operations further inject structural diversity. On CombiBench and ProofNet, under a strict generator-call budget of T = 100, FormalEvolve reaches semantic hit rates (SH@100) of 58.0% and 84.9%, and reduces cross-problem concentration of semantic successes(lower Gini). Under a fixed prover budget, FormalEvolve also improves downstream proving performance on CombiBench. Code will be released publicly.

View on arXiv PDF

Similar