ROLGLOSep 1, 2025

Constrained Decoding for Robotics Foundation Models

arXiv:2509.01728v32 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses safety concerns for robotic systems using foundation models, though it is incremental as it builds on existing models without modifying their training.

The paper tackles the lack of explicit safety guarantees in robotic foundation models by introducing SafeDec, a constrained decoding framework that enforces safety specifications using Signal Temporal Logic, resulting in provably safe actions without retraining and improved performance on tasks from the CHORES benchmark.

Recent advances in the development of robotic foundation models have led to promising end-to-end and general-purpose capabilities in robotic systems. Trained on vast datasets of simulated and real-world trajectories, these models map multimodal observations directly to action sequences for physical execution. Despite promising real-world capabilities, these models are still data-driven and, therefore, lack explicit notions of behavioral correctness. We address this gap by introducing SafeDec, a constrained decoding framework for autoregressive, robot foundation models that enforces invariant safety specifications on candidate action trajectories. Task-specific safety rules are expressed as Signal Temporal Logic (STL) formulas and are enforced at inference time with minimal overhead. Our method ensures that generated actions provably satisfy STL specifications under assumed dynamics at runtime without retraining , while remaining agnostic of the underlying policy. We evaluate SafeDec on tasks from the CHORES benchmark for state-of-the-art generalist policies (e.g., SPOC, Flare, PoliFormer) across hundreds of procedurally generated environments and show that our decoding-time interventions are useful not only for filtering unsafe actions but also for conditional action generation. Videos are available at constrained-robot-fms.github.io.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes