CVDec 1, 2020

A compact sequence encoding scheme for online human activity recognition in HRI applications

arXiv:2012.00873v1
AI Analysis

This work addresses the problem of deploying human activity recognition on resource-constrained robotic hardware for future household robotic assistants, which is an incremental improvement for HRI applications.

This paper proposes a compact sequence encoding scheme for online human activity recognition, transforming spatio-temporal action sequences into compact representations using Mahalanobis distance-based shape features and the Radon transform. This allows for robust end-to-end online action recognition deployable on hardware without extreme computing capabilities.

Human activity recognition and analysis has always been one of the most active areas of pattern recognition and machine intelligence, with applications in various fields, including but not limited to exertion games, surveillance, sports analytics and healthcare. Especially in Human-Robot Interaction, human activity understanding plays a crucial role as household robotic assistants are a trend of the near future. However, state-of-the-art infrastructures that can support complex machine intelligence tasks are not always available, and may not be for the average consumer, as robotic hardware is expensive. In this paper we propose a novel action sequence encoding scheme which efficiently transforms spatio-temporal action sequences into compact representations, using Mahalanobis distance-based shape features and the Radon transform. This representation can be used as input for a lightweight convolutional neural network. Experiments show that the proposed pipeline, when based on state-of-the-art human pose estimation techniques, can provide a robust end-to-end online action recognition scheme, deployable on hardware lacking extreme computing capabilities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes