AIMar 23

FormalEvolve: Neuro-Symbolic Evolutionary Search for Diverse and Prover-Effective Autoformalization

arXiv:2603.1982839.11 citationsh-index: 7
AI Analysis

This work addresses the challenge of improving proof-search efficiency in autoformalization for mathematical reasoning and theorem proving, representing an incremental advance with specific performance gains.

The paper tackled the problem of autoformalization by developing FormalEvolve, a neuro-symbolic evolutionary search framework that generates diverse and prover-effective formalizations, achieving semantic hit rates of 58.0% on CombiBench and 84.9% on ProofNet under a budget of 100 calls.

Autoformalization aims to translate natural-language mathematics into compilable, machine-checkable statements. However, semantic consistency does not imply prover effectiveness: even semantically consistent formalizations can differ substantially in proof-search cost and success rate. In this work, we formulate autoformalization as a budgeted, test-time search for semantically consistent repertoires, and propose FormalEvolve, a compilation-gated neuro-symbolic evolutionary framework. FormalEvolve generates diverse candidates via LLM-driven mutation and crossover with bounded patch repair, while symbolic Abstract Syntax Tree (AST) rewrite operations further inject structural diversity. On CombiBench and ProofNet, under a strict generator-call budget of T = 100, FormalEvolve reaches semantic hit rates (SH@100) of 58.0% and 84.9%, and reduces cross-problem concentration of semantic successes(lower Gini). Under a fixed prover budget, FormalEvolve also improves downstream proving performance on CombiBench. Code will be released publicly.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes