AILGJan 18, 2025

Distributionally Robust Policy Evaluation and Learning for Continuous Treatment with Observational Data

arXiv:2501.10693v11 citationsh-index: 4AAAI
Originality Incremental advance
AI Analysis

This addresses a gap in real-world scenarios where existing methods are limited to discrete treatments or assume no distribution shifts, but it is incremental as it extends known techniques to a continuous setting.

The paper tackles the problem of policy evaluation and learning with continuous treatments in offline observational data when distribution shifts occur, by developing distributionally robust estimators using an extended Inverse Probability Weighting method with a kernel function, and shows effectiveness through finite-sample analysis and experiments.

Using offline observational data for policy evaluation and learning allows decision-makers to evaluate and learn a policy that connects characteristics and interventions. Most existing literature has focused on either discrete treatment spaces or assumed no difference in the distributions between the policy-learning and policy-deployed environments. These restrict applications in many real-world scenarios where distribution shifts are present with continuous treatment. To overcome these challenges, this paper focuses on developing a distributionally robust policy under a continuous treatment setting. The proposed distributionally robust estimators are established using the Inverse Probability Weighting (IPW) method extended from the discrete one for policy evaluation and learning under continuous treatments. Specifically, we introduce a kernel function into the proposed IPW estimator to mitigate the exclusion of observations that can occur in the standard IPW method to continuous treatments. We then provide finite-sample analysis that guarantees the convergence of the proposed distributionally robust policy evaluation and learning estimators. The comprehensive experiments further verify the effectiveness of our approach when distribution shifts are present.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes