MLLGDec 27, 2024

Causal machine learning for heterogeneous treatment effects in the presence of missing outcome data

arXiv:2412.19711v22 citationsh-index: 7Biometrics
Originality Incremental advance
AI Analysis

This addresses a commonly overlooked issue in causal machine learning for practitioners dealing with missing data in treatment effect estimation, offering incremental improvements to existing methods.

The paper tackles the problem of missing outcome data in estimating heterogeneous treatment effects, proposing two de-biased machine learning estimators (mDR-learner and mEP-learner) that integrate inverse probability of censoring weights, and shows they achieve oracle efficiency and favorable performance in simulations and a breast cancer trial application.

When estimating heterogeneous treatment effects, missing outcome data can complicate treatment effect estimation, causing certain subgroups of the population to be poorly represented. In this work, we discuss this commonly overlooked problem and consider the impact that missing at random (MAR) outcome data has on causal machine learning estimators for the conditional average treatment effect (CATE). We propose two de-biased machine learning estimators for the CATE, the mDR-learner and mEP-learner, which address the issue of under-representation by integrating inverse probability of censoring weights into the DR-learner and EP-learner respectively. We show that under reasonable conditions, these estimators are oracle efficient, and illustrate their favorable performance through simulated data settings, comparing them to existing CATE estimators, including comparison to estimators which use common missing data techniques. We present an example of their application using the GBSG2 trial, exploring treatment effect heterogeneity when comparing hormonal therapies to non-hormonal therapies among breast cancer patients post surgery, and offer guidance on the decisions a practitioner must make when implementing these estimators.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes