ROMay 11

Data-Asymmetric Latent Imagination and Reranking for 3D Robotic Imitation Learning

arXiv:2605.1016674.2
Predicted impact top 21% in RO · last 90 daysOriginality Incremental advance
AI Analysis

For robotic imitation learning practitioners, this method enables effective use of suboptimal demonstrations, reducing the need for high-quality data.

DALI-R improves 3D robotic imitation learning from mixed-quality trajectories by using a latent world model and reranking, achieving an average 6.8% higher success rate on Adroit and MetaWorld benchmarks with less than 0.7× inference overhead.

Robotic imitation learning typically assumes access to optimal demonstrations, yet real-world data collection often yields suboptimal, exploratory, or even failed trajectories. Discarding such data wastes valuable information about environment dynamics and failure modes, which can instead be leveraged to improve decision-making. While 3D policies reduce reliance on high-quality demonstrations through strong spatial generalization, they still require large-scale data to achieve high task success. To address this, we propose DALI-R, a Data-Asymmetric Latent Imagination and Reranking framework for 3D robotic imitation learning from mixed-quality trajectories. It learns a Latent World Model over 3D point clouds for imagined rollouts and a Task Completion Scorer that reranks candidate action chunks, improving decision-making without additional high-quality demonstrations. We instantiate DALI-R with both diffusion and efficient flow-matching policies and evaluate it on Adroit and MetaWorld benchmarks. Across the two evaluated 3D base policies, DALI-R achieves an average $6.8$\% improvement in success rate while incurring less than $0.7\times$ additional inference overhead.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes