A Computational Interface to Translate Strategic Intent from Unstructured Language in a Low-Data Setting
This addresses the challenge for autonomous systems to operate independently according to user plans without frequent guidance, though it appears incremental as it builds on prior work on language specification.
The paper tackles the problem of interpreting high-level strategic intent from unstructured language in mixed-initiative human-AI tasks, and shows that their model significantly outperforms human interpreters and ChatGPT in inferring goals and constraints in a low-data setting.
Many real-world tasks involve a mixed-initiative setup, wherein humans and AI systems collaboratively perform a task. While significant work has been conducted towards enabling humans to specify, through language, exactly how an agent should complete a task (i.e., low-level specification), prior work lacks on interpreting the high-level strategic intent of the human commanders. Parsing strategic intent from language will allow autonomous systems to independently operate according to the user's plan without frequent guidance or instruction. In this paper, we build a computational interface capable of translating unstructured language strategies into actionable intent in the form of goals and constraints. Leveraging a game environment, we collect a dataset of over 1000 examples, mapping language strategies to the corresponding goals and constraints, and show that our model, trained on this dataset, significantly outperforms human interpreters in inferring strategic intent (i.e., goals and constraints) from language (p < 0.05). Furthermore, we show that our model (125M parameters) significantly outperforms ChatGPT for this task (p < 0.05) in a low-data setting.