AI HC ROJun 27, 2025

Bootstrapping Human-Like Planning via LLMs

David Porfirio, Vincent Hsiao, Morgan Fine-Morris, Leslie Smith, Laura M. Hiatt

arXiv:2506.22604v15.82 citationsh-index: 8RO-MAN

Originality Incremental advance

AI Analysis

This work addresses the need for accessible robot programming for end users, though it is incremental in combining existing paradigms.

The paper tackled the problem of combining natural language and drag-and-drop interfaces for robot task specification by developing an LLM-based pipeline that generates human-like action sequences from natural language input, finding that larger models outperform smaller ones but smaller models still achieve satisfactory performance.

Robot end users increasingly require accessible means of specifying tasks for robots to perform. Two common end-user programming paradigms include drag-and-drop interfaces and natural language programming. Although natural language interfaces harness an intuitive form of human communication, drag-and-drop interfaces enable users to meticulously and precisely dictate the key actions of the robot's task. In this paper, we investigate the degree to which both approaches can be combined. Specifically, we construct a large language model (LLM)-based pipeline that accepts natural language as input and produces human-like action sequences as output, specified at a level of granularity that a human would produce. We then compare these generated action sequences to another dataset of hand-specified action sequences. Although our results reveal that larger models tend to outperform smaller ones in the production of human-like action sequences, smaller models nonetheless achieve satisfactory performance.

View on arXiv PDF

Similar