ROApr 20

UniDomain: Pretraining a Unified PDDL Domain from Real-World Demonstrations for Generalizable Robot Task Planning

Haoming Ye, Yunxiao Xiao, Cewu Lu, Panpan Cai

arXiv:2507.2154572.33 citationsh-index: 4

Predicted impact top 19% in RO · last 90 daysOriginality Incremental advance

AI Analysis

For robotic task planning, it addresses the bottleneck of handcrafted or narrow symbolic domains by automatically constructing a large-scale, generalizable domain from demonstrations.

UniDomain pre-trains a unified PDDL domain from 12,393 robot manipulation videos, enabling zero-shot task planning on unseen real-world tasks with up to 58% higher success and 160% better plan optimality than LLM baselines.

Robotic task planning in real-world environments requires reasoning over implicit constraints from language and vision. While LLMs and VLMs offer strong priors, they struggle with long-horizon structure and symbolic grounding. Existing methods that combine LLMs with symbolic planning often rely on handcrafted or narrow domains, limiting generalization. We propose UniDomain, a framework that pre-trains a PDDL domain from robot manipulation demonstrations and applies it for online robotic task planning. It extracts atomic domains from 12,393 manipulation videos to form a unified domain with 3137 operators, 2875 predicates, and 16481 causal edges. Given a target class of tasks, it retrieves relevant atomics from the unified domain and systematically fuses them into high-quality meta-domains to support compositional generalization in planning. Experiments on diverse real-world tasks show that UniDomain solves complex, unseen tasks in a zero-shot manner, achieving up to 58% higher task success and 160% improvement in plan optimality over state-of-the-art LLM and LLM-PDDL baselines.

View on arXiv PDF

Similar