LGMLNov 18, 2019

Bayesian Recurrent Framework for Missing Data Imputation and Prediction with Clinical Time Series

arXiv:1911.07572v21 citations
AI Analysis

This work addresses missing data uncertainty in clinical time series for healthcare applications, offering a novel method that improves reliability assessments.

The paper tackles the problem of missing data in clinical time series by introducing a Bayesian recurrent framework for simultaneous imputation and prediction, demonstrating strong performance gains over state-of-the-art methods on mortality prediction tasks using MIMIC-III and PhysioNet datasets.

Real-world clinical time series data sets exhibit a high prevalence of missing values. Hence, there is an increasing interest in missing data imputation. Traditional statistical approaches impose constraints on the data-generating process and decouple imputation from prediction. Recent works propose recurrent neural network based approaches for missing data imputation and prediction with time series data. However, they generate deterministic outputs and neglect the inherent uncertainty. In this work, we introduce a unified Bayesian recurrent framework for simultaneous imputation and prediction on time series data sets. We evaluate our approach on two real-world mortality prediction tasks using the MIMIC-III and PhysioNet benchmark datasets. We demonstrate strong performance gains over state-of-the-art (SOTA) methods, and provide strategies to use the resulting probability distributions to better assess reliability of the imputations and predictions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes