Covariate Balancing and Riesz Regression Should Be Guided by the Neyman Orthogonal Score in Debiased Machine Learning

arXiv:2605.0638634.2
Predicted impact top 66% in EM · last 90 daysOriginality Synthesis-oriented
AI Analysis

This position paper clarifies the theoretical foundation for balancing in debiased machine learning, providing guidance for practitioners on when covariate balancing is appropriate versus when regressor balancing is needed.

This paper argues that in debiased machine learning, balancing functions should be derived from the Neyman orthogonal score, not chosen only as functions of covariates. For ATE estimation under treatment effect heterogeneity, covariate balancing can leave treatment-specific components unbalanced, so regressor balancing using Riesz regression with basis functions of the full regressor is advocated as the general principle.

This position paper argues that, in debiased machine learning, balancing functions should be derived from the Neyman orthogonal score, not chosen only as functions of covariates. Covariate balancing is effective when the regression error entering the score can be represented by functions of covariates alone, and it is the natural finite-dimensional approximation for targets such as ATT counterfactual means. For ATE estimation under treatment effect heterogeneity, however, the score error generally contains treatment-specific components because the outcome regression is a function of the full regressor $X=(D,Z)$. In that case, balancing common functions of $Z$ can leave the treatment-specific component unbalanced. We therefore advocate regressor balancing, implemented by Riesz regression with basis functions of $X$, as the general balancing principle for DML. The position is not that covariate balancing is invalid, but that covariate balancing should be understood as the special case that is appropriate when the score-relevant regression error is a function of covariates alone.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes