SPCYLGNov 18, 2020

Self-supervised transfer learning of physiological representations from free-living wearable data

arXiv:2011.12121v10.0052 citations
AI Analysis65

This research addresses the problem of inferring high-level health outcomes from low-level wearable signals for large-scale health and lifestyle monitoring, offering an incremental improvement by proposing a novel self-supervised approach.

This paper introduces a self-supervised representation learning method that uses activity and heart rate (HR) signals from free-living wearable data to create physiological representations. The method leverages the physiological relationship between activity and HR, using HR responses as a supervisory signal for activity data, and employs a custom quantile loss function for the long-tailed HR distribution. The resulting embeddings can forecast HR accurately and generalize to various downstream tasks, outperforming unsupervised autoencoders and common biomarkers in predicting health, fitness, and demographic characteristics.

Wearable devices such as smartwatches are becoming increasingly popular tools for objectively monitoring physical activity in free-living conditions. To date, research has primarily focused on the purely supervised task of human activity recognition, demonstrating limited success in inferring high-level health outcomes from low-level signals. Here, we present a novel self-supervised representation learning method using activity and heart rate (HR) signals without semantic labels. With a deep neural network, we set HR responses as the supervisory signal for the activity data, leveraging their underlying physiological relationship. In addition, we propose a custom quantile loss function that accounts for the long-tailed HR distribution present in the general population. We evaluate our model in the largest free-living combined-sensing dataset (comprising >280k hours of wrist accelerometer & wearable ECG data). Our contributions are two-fold: i) the pre-training task creates a model that can accurately forecast HR based only on cheap activity sensors, and ii) we leverage the information captured through this task by proposing a simple method to aggregate the learnt latent representations (embeddings) from the window-level to user-level. Notably, we show that the embeddings can generalize in various downstream tasks through transfer learning with linear classifiers, capturing physiologically meaningful, personalized information. For instance, they can be used to predict variables associated with individuals' health, fitness and demographic characteristics, outperforming unsupervised autoencoders and common bio-markers. Overall, we propose the first multimodal self-supervised method for behavioral and physiological data with implications for large-scale health and lifestyle monitoring.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes