CVApr 30, 2024

MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model

Tsinghua
arXiv:2404.19759v3164 citationsh-index: 10ECCV
Originality Incremental advance
AI Analysis

This work addresses runtime inefficiency for researchers and practitioners in AI-driven animation or robotics, though it is incremental as it builds on existing motion latent diffusion models.

The paper tackles the problem of slow runtime in text-conditioned controllable motion generation by proposing MotionLCM, which extends latent consistency models to achieve real-time generation with one-step or few-step inference. The result is a method that generates human motions with text and control signals in real-time, as demonstrated by experimental results showing remarkable capabilities and efficiency.

This work introduces MotionLCM, extending controllable motion generation to a real-time level. Existing methods for spatial-temporal control in text-conditioned motion generation suffer from significant runtime inefficiency. To address this issue, we first propose the motion latent consistency model (MotionLCM) for motion generation, building on the motion latent diffusion model. By adopting one-step (or few-step) inference, we further improve the runtime efficiency of the motion latent diffusion model for motion generation. To ensure effective controllability, we incorporate a motion ControlNet within the latent space of MotionLCM and enable explicit control signals (i.e., initial motions) in the vanilla motion space to further provide supervision for the training process. By employing these techniques, our approach can generate human motions with text and control signals in real-time. Experimental results demonstrate the remarkable generation and controlling capabilities of MotionLCM while maintaining real-time runtime efficiency.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes