LLM-Guided Strategy Synthesis for Scalable Equality Saturation

Chenyun Yin, Youwei Xiao, Yuze Luo, Yuyang Zou, Yun Liang

arXiv:2604.1736443.5h-index: 4

Predicted impact top 79% in AI · last 90 daysOriginality Incremental advance

AI Analysis

It automates strategy design for e-graph-based compilers, a major obstacle to their adoption, by using LLMs to search for high-quality strategies.

EggMind is an LLM-guided framework for synthesizing reusable equality saturation strategies, reducing final cost by 45.1% and peak RAM by 69.1% on vectorization benchmarks compared to full EqSat.

Equality saturation (EqSat) is a powerful optimization paradigm that compactly represents many equivalent programs in an e-graph and delays commitment until extraction selects a lowest-cost program. Making EqSat effective, therefore, requires not only domain-specific rewrite rules but also domain-specific strategies. Today, much of this strategy design is still manual, making it a major obstacle to automating e-graph-based compilers. Recent rule-synthesis frameworks can automatically infer large rewrite vocabularies from semantic specifications, but they also enlarge the rewrite space and further exacerbate e-graph explosion. Although large language models (LLMs) make automated strategy synthesis plausible, directly evolving backend code remains ineffective in practice. The search lacks reusable strategy abstractions and actionable feedback, and can easily trigger e-graph explosion or converge to poor designs. We present EggMind, an LLM-guided, end-to-end framework for synthesizing reusable EqSat strategies. At its core, EggMind introduces a domain-specific language, EqSatL, to represent EqSat strategies as explicit and inspectable artifacts. It then proposes an LLM-guided agentic workflow, equipped with novel techniques including proof-derived rewrite motif caching and tractability guidance, to search efficiently for high-quality strategies while keeping synthesis stable under e-graph growth. Evaluation shows that EggMind substantially improves the resource-quality trade-off on vectorization benchmarks, reducing final cost by 45.1% and peak RAM by 69.1% relative to full EqSat. We further show that the same methodology transfers effectively to an XLA-based tensor compiler, and demonstrate its practical potential in a logic-synthesis case study with augmented rewrite spaces.

View on arXiv PDF

Similar