ROAIMay 11

Guided Streaming Stochastic Interpolant Policy

arXiv:2605.1005173.4
Predicted impact top 22% in RO · last 90 daysOriginality Highly original
AI Analysis

For robot control, it enables fast, reactive inference-time guidance for dynamic objectives without retraining, addressing a key limitation of existing chunk-based methods.

The paper derives optimal guidance for Stochastic Interpolants via the Backward Kolmogorov Equation and applies it to real-time robot control with the Streaming Stochastic Interpolant Policy (SSIP), achieving superior reactivity and physically valid guidance in dynamic environments compared to chunk-based policies.

Inference-time guidance is essential for steering generative robot policies toward dynamic objectives without retraining, yet existing methods are largely confined to chunk-based architectures that exhibit high latency and lack the reactivity needed for test-time preference alignment or obstacle avoidance. In this work, we formally derive the optimal guidance term for Stochastic Interpolants (SI) by analyzing the value function's time evolution via the Backward Kolmogorov Equation, establishing a modified drift that theoretically guarantees sampling from a target distribution. We apply this framework to real-time control through the Streaming Stochastic Interpolant Policy (SSIP), which generalizes the deterministic Streaming Flow Policy (SFP). Unifying this guidance law with the streaming architecture enables fast and reactive control. To support diverse deployment needs, we propose two complementary mechanisms: training-free Stochastic Trajectory Ensemble Guidance (STEG) that computes gradients on-the-fly for zero-shot adaptation, and training-based Conditional Critic Guidance (CCG) for amortized inference. Empirical evaluations demonstrate that our guided streaming approach significantly outperforms conventional chunk-based policies in reactivity and provides superior, physically valid guidance for dynamic, unstructured environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes