CVLGNov 30, 2021

In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image Domains

arXiv:2111.15124v218 citations
Originality Incremental advance
AI Analysis

This work addresses patient monitoring in medical settings by enabling pose estimation from unseen and privacy-preserving image domains, though it is incremental as it builds on existing self-supervised and multi-modal methods.

The paper tackled in-bed human pose estimation from privacy-preserving thermal infrared images by proposing a multi-modal conditional variational autoencoder with HRNet, achieving results comparable to baseline models that require multiple modalities at inference.

Medical applications have benefited greatly from the rapid advancement in computer vision. Considering patient monitoring in particular, in-bed human posture estimation offers important health-related metrics with potential value in medical condition assessments. Despite great progress in this domain, it remains challenging due to substantial ambiguity during occlusions, and the lack of large corpora of manually labeled data for model training, particularly with domains such as thermal infrared imaging which are privacy-preserving, and thus of great interest. Motivated by the effectiveness of self-supervised methods in learning features directly from data, we propose a multi-modal conditional variational autoencoder (MC-VAE) capable of reconstructing features from missing modalities seen during training. This approach is used with HRNet to enable single modality inference for in-bed pose estimation. Through extensive evaluations, we demonstrate that body positions can be effectively recognized from the available modality, achieving on par results with baseline models that are highly dependent on having access to multiple modes at inference time. The proposed framework supports future research towards self-supervised learning that generates a robust model from a single source, and expects it to generalize over many unknown distributions in clinical environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes