CVDec 2, 2022

Video-based Pose-Estimation Data as Source for Transfer Learning in Human Activity Recognition

arXiv:2212.01353v12 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This addresses data scarcity in HAR for applications using wearable devices, but it is incremental as it adapts existing transfer learning methods to a new data source.

The paper tackles the scarcity of annotated on-body device data for Human Activity Recognition (HAR) by using video-based pose-estimation datasets as a source for transfer learning, resulting in improved performance on three on-body device datasets.

Human Activity Recognition (HAR) using on-body devices identifies specific human actions in unconstrained environments. HAR is challenging due to the inter and intra-variance of human movements; moreover, annotated datasets from on-body devices are scarce. This problem is mainly due to the difficulty of data creation, i.e., recording, expensive annotation, and lack of standard definitions of human activities. Previous works demonstrated that transfer learning is a good strategy for addressing scenarios with scarce data. However, the scarcity of annotated on-body device datasets remains. This paper proposes using datasets intended for human-pose estimation as a source for transfer learning; specifically, it deploys sequences of annotated pixel coordinates of human joints from video datasets for HAR and human pose estimation. We pre-train a deep architecture on four benchmark video-based source datasets. Finally, an evaluation is carried out on three on-body device datasets improving HAR performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes