CVApr 8, 2019

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment

arXiv:1904.04346v2212 citationsHas Code
AI Analysis

This addresses the challenge of accurately assessing performance quality in sports like diving, offering a more robust method than single-task learning, though it is incremental in improving existing AQA techniques.

The paper tackled the problem of action quality assessment (AQA) by proposing a multitask learning approach that jointly learns features for fine-grained action recognition, commentary generation, and AQA scoring, achieving a new state-of-the-art rank correlation of 90.44% on a new diving dataset.

Can performance on the task of action quality assessment (AQA) be improved by exploiting a description of the action and its quality? Current AQA and skills assessment approaches propose to learn features that serve only one task - estimating the final score. In this paper, we propose to learn spatio-temporal features that explain three related tasks - fine-grained action recognition, commentary generation, and estimating the AQA score. A new multitask-AQA dataset, the largest to date, comprising of 1412 diving samples was collected to evaluate our approach (https://github.com/ParitoshParmar/MTL-AQA). We show that our MTL approach outperforms STL approach using two different kinds of architectures: C3D-AVG and MSCADC. The C3D-AVG-MTL approach achieves the new state-of-the-art performance with a rank correlation of 90.44%. Detailed experiments were performed to show that MTL offers better generalization than STL, and representations from action recognition models are not sufficient for the AQA task and instead should be learned.

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes