LGMLFeb 8, 2024

Latent variable model for high-dimensional point process with structured missingness

arXiv:2402.05758v21 citationsh-index: 51ICML
AI Analysis

This work addresses a practical challenge in fields like healthcare and seismology by handling multiple data limitations simultaneously, though it appears incremental as it builds on existing methods like variational autoencoders.

The authors tackled the problem of modeling high-dimensional longitudinal data with structured missingness and unknown stochastic measurement times by proposing a latent-variable model using Gaussian processes and variational autoencoders, achieving competitive performance on simulated and real datasets.

Longitudinal data are important in numerous fields, such as healthcare, sociology and seismology, but real-world datasets present notable challenges for practitioners because they can be high-dimensional, contain structured missingness patterns, and measurement time points can be governed by an unknown stochastic process. While various solutions have been suggested, the majority of them have been designed to account for only one of these challenges. In this work, we propose a flexible and efficient latent-variable model that is capable of addressing all these limitations. Our approach utilizes Gaussian processes to capture temporal correlations between samples and their associated missingness masks as well as to model the underlying point process. We construct our model as a variational autoencoder together with deep neural network parameterised encoder and decoder models, and develop a scalable amortised variational inference approach for efficient model training. We demonstrate competitive performance using both simulated and real datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes