ROAIOct 24, 2025

Two-Steps Diffusion Policy for Robotic Manipulation via Genetic Denoising

arXiv:2510.21991v15 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses the problem of slow inference in diffusion policies for robotic manipulation, offering a domain-specific improvement.

The paper tackled the inefficiency of diffusion models in robotic manipulation by tailoring the denoising process to action distributions, achieving effective operation with as few as 5 neural function evaluations and up to 20% performance gains with fewer steps.

Diffusion models, such as diffusion policy, have achieved state-of-the-art results in robotic manipulation by imitating expert demonstrations. While diffusion models were originally developed for vision tasks like image and video generation, many of their inference strategies have been directly transferred to control domains without adaptation. In this work, we show that by tailoring the denoising process to the specific characteristics of embodied AI tasks -- particularly structured, low-dimensional nature of action distributions -- diffusion policies can operate effectively with as few as 5 neural function evaluations (NFE). Building on this insight, we propose a population-based sampling strategy, genetic denoising, which enhances both performance and stability by selecting denoising trajectories with low out-of-distribution risk. Our method solves challenging tasks with only 2 NFE while improving or matching performance. We evaluate our approach across 14 robotic manipulation tasks from D4RL and Robomimic, spanning multiple action horizons and inference budgets. In over 2 million evaluations, our method consistently outperforms standard diffusion-based policies, achieving up to 20\% performance gains with significantly fewer inference steps.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes