ROJun 2

OMP: One-step Meanflow Policy with Directional Alignment

Han Fang, Yize Huang, Yuheng Zhao, Paul Weng, Xiao Li, Yutong Ban

arXiv:2512.1934793.34 citations

Predicted impact top 8% in RO · last 90 daysOriginality Highly original

AI Analysis

Enables real-time, high-fidelity robot manipulation for practitioners needing fast and accurate generative policies.

OMP achieves state-of-the-art success rates and trajectory accuracy on Adroit and Meta-World benchmarks while enabling single-step inference, overcoming spectral bias and gradient starvation in low-velocity regimes.

Robot manipulation has increasingly adopted data-driven generative policy frameworks, yet the field faces a persistent trade-off: diffusion models suffer from high inference latency, while flow-based methods often require complex architectural constraints. Although in image generation domain, the MeanFlow paradigm offers a path to single-step inference, its direct application to robotics is impeded by critical theoretical pathologies, specifically spectral bias and gradient starvation in low-velocity regimes. To overcome these limitations, we propose the One-step MeanFlow Policy (OMP), a novel framework designed for high-fidelity, real-time manipulation. We introduce a lightweight directional alignment mechanism to explicitly synchronize predicted velocities with true mean velocities. Furthermore, we implement a Differential Derivation Equation (DDE) to approximate the Jacobian-Vector Product (JVP) operator, which decouples forward and backward passes to significantly reduce memory complexity. Extensive experiments on the Adroit and Meta-World benchmarks demonstrate that OMP outperforms state-of-the-art methods in success rate and trajectory accuracy, particularly in high-precision tasks, while retaining the efficiency of single-step generation.

View on arXiv PDF

Similar