CLAIMay 9, 2023

Distilling Script Knowledge from Large Language Models for Constrained Language Planning

arXiv:2305.05252v5245 citations
Originality Incremental advance
AI Analysis

This work addresses a gap in AI planning for specific, constrained goals, which is incremental as it builds on prior language model-based planning but focuses on understudied multi-facet constraints.

The paper tackles the problem of constrained language planning, where goals have multi-facet constraints (e.g., 'make a cake for diabetics'), by proposing an overgenerate-then-filter approach to improve large language models (LLMs) and distilling a dataset of 55,000 scripts. The method significantly enhances LLMs' ability in this task, particularly in constraint faithfulness, and the dataset effectively enables smaller LMs to perform constrained language planning.

In everyday life, humans often plan their actions by following step-by-step instructions in the form of goal-oriented scripts. Previous work has exploited language models (LMs) to plan for abstract goals of stereotypical activities (e.g., "make a cake"), but leaves more specific goals with multi-facet constraints understudied (e.g., "make a cake for diabetics"). In this paper, we define the task of constrained language planning for the first time. We propose an overgenerate-then-filter approach to improve large language models (LLMs) on this task, and use it to distill a novel constrained language planning dataset, CoScript, which consists of 55,000 scripts. Empirical results demonstrate that our method significantly improves the constrained language planning ability of LLMs, especially on constraint faithfulness. Furthermore, CoScript is demonstrated to be quite effective in endowing smaller LMs with constrained language planning ability.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes