RO AIJun 8, 2025

Prime the search: Using large language models for guiding geometric task and motion planning by warm-starting tree search

Dongryung Lee, Sejune Joo, Kimin Lee, Beomjoon Kim

arXiv:2506.07062v17.86 citationsh-index: 6Has Codeint j robot res

Originality Incremental advance

AI Analysis

This addresses the computational inefficiency in G-TAMP for robotics by leveraging LLMs' common sense, though it is an incremental improvement over existing hybrid methods.

The paper tackles the problem of guiding geometric task and motion planning (G-TAMP) by using large language models (LLMs) to warm-start Monte Carlo Tree Search, showing that this method outperforms previous LLM planners and pure search algorithms on six different G-TAMP problems.

The problem of relocating a set of objects to designated areas amidst movable obstacles can be framed as a Geometric Task and Motion Planning (G-TAMP) problem, a subclass of task and motion planning (TAMP). Traditional approaches to G-TAMP have relied either on domain-independent heuristics or on learning from planning experience to guide the search, both of which typically demand significant computational resources or data. In contrast, humans often use common sense to intuitively decide which objects to manipulate in G-TAMP problems. Inspired by this, we propose leveraging Large Language Models (LLMs), which have common sense knowledge acquired from internet-scale data, to guide task planning in G-TAMP problems. To enable LLMs to perform geometric reasoning, we design a predicate-based prompt that encodes geometric information derived from a motion planning algorithm. We then query the LLM to generate a task plan, which is then used to search for a feasible set of continuous parameters. Since LLMs are prone to mistakes, instead of committing to LLM's outputs, we extend Monte Carlo Tree Search (MCTS) to a hybrid action space and use the LLM to guide the search. Unlike the previous approach that calls an LLM at every node and incurs high computational costs, we use it to warm-start the MCTS with the nodes explored in completing the LLM's task plan. On six different G-TAMP problems, we show our method outperforms previous LLM planners and pure search algorithms. Code can be found at: https://github.com/iMSquared/prime-the-search

View on arXiv PDF Code

Similar