CVSep 15, 2021

Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers

Angel Martínez-González, Michael Villamizar, Jean-Marc Odobez

arXiv:2109.07531v120.294 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses human motion prediction for applications like robotics and animation, but it is incremental as it adapts existing Transformer methods to a non-autoregressive approach.

The authors tackled human motion prediction by proposing a non-autoregressive Transformer model that decodes sequences in parallel, achieving competitive results on two public datasets, particularly for short-term predictions.

We propose to leverage Transformer architectures for non-autoregressive human motion prediction. Our approach decodes elements in parallel from a query sequence, instead of conditioning on previous predictions such as instate-of-the-art RNN-based approaches. In such a way our approach is less computational intensive and potentially avoids error accumulation to long term elements in the sequence. In that context, our contributions are fourfold: (i) we frame human motion prediction as a sequence-to-sequence problem and propose a non-autoregressive Transformer to infer the sequences of poses in parallel; (ii) we propose to decode sequences of 3D poses from a query sequence generated in advance with elements from the input sequence;(iii) we propose to perform skeleton-based activity classification from the encoder memory, in the hope that identifying the activity can improve predictions;(iv) we show that despite its simplicity, our approach achieves competitive results in two public datasets, although surprisingly more for short term predictions rather than for long term ones.

View on arXiv PDF Code

Similar