MoGlow: Probabilistic and controllable motion synthesis using normalising flows
This addresses the need for efficient, real-time motion generation in applications like animation, games, and social robotics, though it is incremental as it builds on existing normalising flow techniques.
The paper tackled the problem of probabilistic and controllable motion synthesis by introducing a new class of models based on normalising flows, achieving motion quality close to recorded motion capture in evaluations on human and quadruped locomotion datasets.
Data-driven modelling and synthesis of motion is an active research area with applications that include animation, games, and social robotics. This paper introduces a new class of probabilistic, generative, and controllable motion-data models based on normalising flows. Models of this kind can describe highly complex distributions, yet can be trained efficiently using exact maximum likelihood, unlike GANs or VAEs. Our proposed model is autoregressive and uses LSTMs to enable arbitrarily long time-dependencies. Importantly, is is also causal, meaning that each pose in the output sequence is generated without access to poses or control inputs from future time steps; this absence of algorithmic latency is important for interactive applications with real-time motion control. The approach can in principle be applied to any type of motion since it does not make restrictive, task-specific assumptions regarding the motion or the character morphology. We evaluate the models on motion-capture datasets of human and quadruped locomotion. Objective and subjective results show that randomly-sampled motion from the proposed method outperforms task-agnostic baselines and attains a motion quality close to recorded motion capture.