APLGMLSep 22, 2024

Causal Inference with Double/Debiased Machine Learning for Evaluating the Health Effects of Multiple Mismeasured Pollutants

arXiv:2410.07135v12 citationsh-index: 45
Originality Incremental advance
AI Analysis

This work addresses a critical issue in epidemiology for researchers and policymakers by providing a more accurate method to assess health impacts of air pollution, though it is incremental as it builds on existing double/debiased machine learning techniques.

The paper tackled the problem of estimating causal health effects of multiple air pollutants when exposure measurements are inaccurate and correlated, by extending a double/debiased machine learning approach with regression calibration to correct for measurement error. The method reduced bias in simulations and identified two PM2.5 constituents, Br and Mn, as having negative causal effects on cognitive function in a real-world study.

One way to quantify exposure to air pollution and its constituents in epidemiologic studies is to use an individual's nearest monitor. This strategy results in potential inaccuracy in the actual personal exposure, introducing bias in estimating the health effects of air pollution and its constituents, especially when evaluating the causal effects of correlated multi-pollutant constituents measured with correlated error. This paper addresses estimation and inference for the causal effect of one constituent in the presence of other PM2.5 constituents, accounting for measurement error and correlations. We used a linear regression calibration model, fitted with generalized estimating equations in an external validation study, and extended a double/debiased machine learning (DML) approach to correct for measurement error and estimate the effect of interest in the main study. We demonstrated that the DML estimator with regression calibration is consistent and derived its asymptotic variance. Simulations showed that the proposed estimator reduced bias and attained nominal coverage probability across most simulation settings. We applied this method to assess the causal effects of PM2.5 constituents on cognitive function in the Nurses' Health Study and identified two PM2.5 constituents, Br and Mn, that showed a negative causal effect on cognitive function after measurement error correction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes