MEAIMay 17, 2023

The Impact of Missing Data on Causal Discovery: A Multicentric Clinical Study

arXiv:2305.10050v28 citations
Originality Incremental advance
AI Analysis

This addresses the issue of missing data in causal inference for clinical decision-making, though it is incremental as it builds on existing methods.

The study tackled the problem of missing data affecting causal discovery in clinical observational studies by extending state-of-the-art algorithms to incorporate expert knowledge, resulting in a clinically-relevant causal graph validated by physicians.

Causal inference for testing clinical hypotheses from observational data presents many difficulties because the underlying data-generating model and the associated causal graph are not usually available. Furthermore, observational data may contain missing values, which impact the recovery of the causal graph by causal discovery algorithms: a crucial issue often ignored in clinical studies. In this work, we use data from a multi-centric study on endometrial cancer to analyze the impact of different missingness mechanisms on the recovered causal graph. This is achieved by extending state-of-the-art causal discovery algorithms to exploit expert knowledge without sacrificing theoretical soundness. We validate the recovered graph with expert physicians, showing that our approach finds clinically-relevant solutions. Finally, we discuss the goodness of fit of our graph and its consistency from a clinical decision-making perspective using graphical separation to validate causal pathways.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes