ML CR LG EMFeb 22, 2022

Differentially Private Estimation of Heterogeneous Causal Effects

Fengshi Niu, Harsha Nori, Brian Quistorff, Rich Caruana, Donald Ngwe, Aadharsh Kannan

arXiv:2202.11043v113.122 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses privacy concerns in causal inference for domains like healthcare, but it is incremental as it builds on existing estimators with privacy guarantees.

The paper tackles the problem of estimating heterogeneous causal effects from sensitive data by introducing a differentially private meta-algorithm, showing that multi-stage estimators incur larger accuracy loss than single-stage ones, with most loss due to increased variance rather than bias.

Estimating heterogeneous treatment effects in domains such as healthcare or social science often involves sensitive data where protecting privacy is important. We introduce a general meta-algorithm for estimating conditional average treatment effects (CATE) with differential privacy (DP) guarantees. Our meta-algorithm can work with simple, single-stage CATE estimators such as S-learner and more complex multi-stage estimators such as DR and R-learner. We perform a tight privacy analysis by taking advantage of sample splitting in our meta-algorithm and the parallel composition property of differential privacy. In this paper, we implement our approach using DP-EBMs as the base learner. DP-EBMs are interpretable, high-accuracy models with privacy guarantees, which allow us to directly observe the impact of DP noise on the learned causal model. Our experiments show that multi-stage CATE estimators incur larger accuracy loss than single-stage CATE or ATE estimators and that most of the accuracy loss from differential privacy is due to an increase in variance, not biased estimates of treatment effects.

View on arXiv PDF Code

Similar