AIMay 28

LLM-Evolved Domain-Independent Heuristics for Symbolic AI Planning

arXiv:2605.2964919.7
Predicted impact top 56% in AI · last 90 daysOriginality Highly original
AI Analysis

For the AI planning community, this work demonstrates that LLMs can evolve domain-independent heuristics that exceed decades of human-designed heuristics, offering a new automated approach to heuristic design.

This paper introduces the first LLM-generated domain-independent heuristics for symbolic AI planning that outperform hand-engineered state-of-the-art heuristics, solving more tasks on unseen domains and spanning the Pareto frontier of the informedness-speed tradeoff.

Heuristic search is the dominant paradigm in symbolic AI planning, and the strongest heuristics are the result of decades of work by planning researchers. Recent work has shown that large language models (LLMs) can design heuristics for individual planning domains, but no LLM-generated heuristic has so far worked on arbitrary planning tasks. In this paper, we use evolutionary search to produce the first LLM-generated domain-independent heuristics that exceed the hand-engineered state of the art. We let an LLM mutate parent heuristics written in C++, store candidates in a MAP-Elites archive keyed on informedness and speed and calculate fitness scores by blending coverage with solving time. To place the evolved programs in context, we additionally benchmark a broad set of hand-engineered heuristics on their informedness-speed tradeoff, which to our knowledge has not been done before. On unseen testing domains, our best evolved heuristic solves more tasks than even the strongest baseline, with our full heuristic suite spanning the Pareto frontier of said tradeoff. We also find that seeding evolution from the trivial blind heuristic outperforms seeding from the strong FF heuristic, even when the resulting program is itself an FF variant, and that LLM reasoning effort affects how often candidates compile much more than the quality of those that do. Because the evolved programs are plain C++, they slot into existing planners as drop-in replacements and inherit the soundness and completeness guarantees of the underlying search.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes