ROAIMay 20

Jointly Learning Predicates and Actions Enables Zero-Shot Skill Composition

arXiv:2605.2064862.1
AI Analysis

This work addresses the problem of skill composition in robot learning from demonstration, offering a method to generalize to new skill combinations without retraining.

The paper introduces Predicate Action Skills (PACTS), a class of visuomotor policies that jointly model action trajectories and symbolic predicate outcomes, enabling zero-shot composition of learned skills without retraining. Results show improved action generation and predicate classification, with successful zero-shot skill sequencing demonstrated in robotic tasks.

Learning from Demonstration (LfD) enables robots to learn complex behaviors from expert examples, yet existing approaches often fail to generalize to new compositions of known skills without retraining. Modern generative policies model distributions over action trajectories alone, thus are unable to reason about the symbolic outcomes required for robust composition. We propose that skills should jointly model action trajectories and the symbolic outcomes they induce. To address this gap, we introduce Predicate Action Skills (PACTS), a class of closed-loop visuomotor policies that model skills as a joint generative process over action and predicate belief trajectories, producing coherent action-outcome rollouts within a single model. Jointly generating actions and predicates enables PACTS to learn internal representations that improve both action generation and predicate classification. Furthermore, we demonstrate zero-shot composition of learned skills via planning by leveraging online predicate predictions from PACTS as a symbolic interface for sequencing and monitoring execution. Project website: https://planpacts.github.io/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes