Beyond Differences: Doubly Robust Meta-Learners for Ratio-Based Treatment Effects

arXiv:2605.262885.1h-index: 2

Predicted impact top 89% in ML · last 90 daysOriginality Incremental advance

AI Analysis

For practitioners needing robust ratio-based CATE estimation, especially in confounded observational settings, this work provides a new default method with strong empirical performance.

The paper introduces the Q-Learner for ratio-based conditional average treatment effects (CATE), which decomposes the ratio into two odds ratios, and proposes doubly robust augmentations. In benchmarks, the Q-Learner excels in low-conversion regimes, while the DR learners outperform on observational data.

When treatment effects are naturally expressed as ratios -- as in medicine, pricing, and marketing -- the ratio-based CATE $τ(x) = E[Y|W=1,X=x] / E[Y|W=0,X=x]$ is the appropriate estimand. Yet existing estimators either impose a log-linear parametric structure or apply generic regression without robustness guarantees for this functional. We introduce the Q-Learner, which decomposes $τ(x)$ into a product of two odds ratios, reducing ratio-CATE estimation for binary outcomes to two propensity classification tasks. We further derive doubly robust augmentations for both S/T- and Q-style ratio learners and characterize their distinct robustness properties. In benchmarks on seven RCT datasets, the Q-Learner is the most consistently competitive method in low-conversion regimes, where its propensity-only construction sidesteps the imbalanced regression that hurts outcome-based estimators. On four observational datasets, where propensity must be estimated and confounding cannot be ruled out, the DR learners introduced here decisively come out on top, making them practitioners' natural default for confounded observational data.

View on arXiv PDF

Similar