AINENov 30, 2024

Aligning LLM+PDDL Symbolic Plans with Human Objective Specifications through Evolutionary Algorithm Guidance

arXiv:2412.00300v21 citationsh-index: 10Has Code2025 IEEE 21st International Conference on Automation Science and Engineering (CASE)
Originality Incremental advance
AI Analysis

This work addresses the challenge of making automated planning tools more accessible to non-experts by enhancing the accuracy of natural language goal specifications, though it is incremental in nature.

The paper tackles the problem of aligning symbolic plans generated by LLM+PDDL systems with human natural language specifications, using an evolutionary algorithm to refine imprecise translations and improve plan adherence, demonstrating improved results in a naval disaster recovery task.

Automated planning using a symbolic planning language, such as PDDL, is a general approach to producing optimal plans to achieve a stated goal. However, creating suitable machine understandable descriptions of the planning domain, problem, and goal requires expertise in the planning language, limiting the utility of these tools for non-expert humans. Recent efforts have explored utilizing a symbolic planner in conjunction with a large language model to generate plans from natural language descriptions given by a non-expert human (LLM+PDDL). Our approach performs initial translation of goal specifications to a set of PDDL goal constraints using an LLM; such translations often result in imprecise symbolic specifications, which are difficult to validate directly. We account for this using an evolutionary approach to generate a population of symbolic goal specifications with slight differences from the initial translation, and utilize a trained LSTM-based validation model to assess whether each induced plan in the population adheres to the natural language specifications. We evaluate our approach on a collection of prototypical specifications in a notional naval disaster recovery task, and demonstrate that our evolutionary approach improve adherence of generated plans to natural language specifications when compared to plans generated using only LLM translations. The code for our method can be found at https://github.com/owenonline/PlanCritic.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes