Cost-optimal Sequential Testing via Doubly Robust Q-learning

arXiv:2604.111655.8h-index: 4
Predicted impact top 70% in ML · last 90 daysOriginality Incremental advance
AI Analysis

For clinicians and healthcare systems, this work provides a principled approach to optimize costly sequential testing decisions from observational data, offering potential cost savings while maintaining diagnostic accuracy.

This paper develops a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data, handling informative missingness due to test availability. The method reduces testing costs without compromising predictive accuracy, as demonstrated in a prostate cancer cohort study.

Clinical decision-making often involves selecting tests that are costly, invasive, or time-consuming, motivating individualized, sequential strategies for what to measure and when to stop ascertaining. We study the problem of learning cost-optimal sequential decision policies from retrospective data, where test availability depends on prior results, inducing informative missingness. Under a sequential missing-at-random mechanism, we develop a doubly robust Q-learning framework for estimating optimal policies. The method introduces path-specific inverse probability weights that account for heterogeneous test trajectories and satisfy a normalization property conditional on the observed history. By combining these weights with auxiliary contrast models, we construct orthogonal pseudo-outcomes that enable unbiased policy learning when either the acquisition model or the contrast model is correctly specified. We establish oracle inequalities for the stage-wise contrast estimators, along with convergence rates, regret bounds, and misclassification rates for the learned policy. Simulations demonstrate improved cost-adjusted performance over weighted and complete-case baselines, and an application to a prostate cancer cohort study illustrates how the method reduces testing cost without compromising predictive accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes