CVMay 8

Towards Highly-Constrained Human Motion Generation with Retrieval-Guided Diffusion Noise Optimization

arXiv:2605.0805491.0
Predicted impact top 14% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For virtual agents and character animation, this method addresses a critical bottleneck in generating motions under extreme spatiotemporal constraints, though it is an incremental improvement over training-free diffusion optimization.

This work tackles highly-constrained human motion generation, such as severe spatial obstacles or specified step counts, which existing methods fail on. The proposed retrieval-guided diffusion noise optimization with relational task parsing solves these tasks, enabling automatic reasoning for what to retrieve.

Generating human motion that satisfies customized zero-shot goal functions, enabling applications such as controllable character animation and behavior synthesis for virtual agents, is a critical capability. While current approaches handle many unseen constraints, they fail on tasks with very challenging spatiotemporal restrictions, such as severe spatial obstacles or specified numbers of walking steps. To equip motion generators for these highly constrained tasks, we present a retrieval-guided method built on the training-free diffusion noise optimization framework. The key idea is to search within large motion datasets for guidance that can potentially satisfy difficult constraints. We introduce relational task parsing to group target constraints and identify the difficult ones to be handled by retrieved reference. A better initialization for diffusion noise is then obtained via a reward-guided mask that combines random noise with retrieved noise. By optimizing diffusion noise from this improved initialization, we successfully solve highly constrained generation tasks. By leveraging LLM for relational task parsing, the whole framework is further enabled to automatically reason for what to retrieve, improving the intelligence of moving agents under a training-free optimization scheme.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes