CVJul 15, 2022

Diverse Human Motion Prediction via Gumbel-Softmax Sampling from an Auxiliary Space

arXiv:2207.07351v159 citationsh-index: 30Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of predicting multiple plausible future motions in computer vision, which is important for applications like animation and robotics, though it is incremental as it builds on existing deep generative models.

The paper tackles the problem of generating diverse human motion predictions from observed poses by proposing a novel sampling strategy that uses an auxiliary space and Gumbel-Softmax sampling to improve diversity and accuracy, with experiments showing significant improvements over previous state-of-the-art methods.

Diverse human motion prediction aims at predicting multiple possible future pose sequences from a sequence of observed poses. Previous approaches usually employ deep generative networks to model the conditional distribution of data, and then randomly sample outcomes from the distribution. While different results can be obtained, they are usually the most likely ones which are not diverse enough. Recent work explicitly learns multiple modes of the conditional distribution via a deterministic network, which however can only cover a fixed number of modes within a limited range. In this paper, we propose a novel sampling strategy for sampling very diverse results from an imbalanced multimodal distribution learned by a deep generative model. Our method works by generating an auxiliary space and smartly making randomly sampling from the auxiliary space equivalent to the diverse sampling from the target distribution. We propose a simple yet effective network architecture that implements this novel sampling strategy, which incorporates a Gumbel-Softmax coefficient matrix sampling method and an aggressive diversity promoting hinge loss function. Extensive experiments demonstrate that our method significantly improves both the diversity and accuracy of the samplings compared with previous state-of-the-art sampling approaches. Code and pre-trained models are available at https://github.com/Droliven/diverse_sampling.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes