ROMar 22

Dreaming the Unseen: World Model-regularized Diffusion Policy for Out-of-Distribution Robustness

arXiv:2603.2101781.9h-index: 25
AI Analysis

This addresses robustness issues in robotics and autonomous systems for real-world deployment, representing a strong specific gain rather than a broad paradigm shift.

The paper tackles the problem of diffusion policies failing under severe out-of-distribution disturbances in visuomotor control by introducing the Dream Diffusion Policy, which integrates a diffusion world model to enable robust state-prediction and achieves a 73.8% success rate on MetaWorld and 83.3% under real-world spatial shifts.

Diffusion policies excel at visuomotor control but often fail catastrophically under severe out-of-distribution (OOD) disturbances, such as unexpected object displacements or visual corruptions. To address this vulnerability, we introduce the Dream Diffusion Policy (DDP), a framework that deeply integrates a diffusion world model into the policy's training objective via a shared 3D visual encoder. This co-optimization endows the policy with robust state-prediction capabilities. When encountering sudden OOD anomalies during inference, DDP detects the real-imagination discrepancy and actively abandons the corrupted visual stream. Instead, it relies on its internal "imagination" (autoregressively forecasted latent dynamics) to safely bypass the disruption, generating imagined trajectories before smoothly realigning with physical reality. Extensive evaluations demonstrate DDP's exceptional resilience. Notably, DDP achieves a 73.8% OOD success rate on MetaWorld (vs. 23.9% without predictive imagination) and an 83.3% success rate under severe real-world spatial shifts (vs. 3.3% without predictive imagination). Furthermore, as a stress test, DDP maintains a 76.7% real-world success rate even when relying entirely on open-loop imagination post-initialization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes