AIMay 5

Self-Improvement for Fast, High-Quality Plan Generation

arXiv:2605.0362540.4
Predicted impact top 77% in AI · last 90 daysOriginality Incremental advance
AI Analysis

For AI planning researchers, this work provides a scalable method to generate high-quality plans faster than traditional symbolic planners.

This paper addresses the challenge of generating high-quality plans in sub-exponential time. Using self-improvement with generative models, they achieve a 30% reduction in plan length over symbolic planners, with over 80% of plans being optimal where known.

Generative models trained on synthetic plan data are a promising approach to generalized planning. Recent work has focused on finding any valid plan, rather than a high-quality solution. We address the challenge of producing high-quality plans, a computationally hard problem, in sub-exponential time. First, we demonstrate that, given optimal data, a decoder-only transformer can generate high-quality plans for unseen problem instances. Second, we show how to self-improve an initial model trained on sub-optimal data. Each round of self-improvement combines multiple model calls with graph search to generate improved plans, used for model fine-tuning. An experimental study on four domains: Blocksworld, Logistics, Labyrinth, and Sokoban, shows on average a 30% reduction in plan length over the source symbolic planner, with over 80% of plans being optimal, where the optimum is known. Plan quality is further improved by inference-time search. The model's latency scales sub-exponentially in contrast to the satisficing and optimal symbolic planners to which we compare. Together, these results suggest that self-improvement with generative models offers a scalable approach for high-quality plan generation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes