LGMay 24, 2025

Conformal Prediction for Uncertainty Estimation in Drug-Target Interaction Prediction

Morteza Rakhshaninejad, Mira Jurgens, Nicolas Dewolf, Willem Waegeman

arXiv:2505.18890v1h-index: 4COPA

Originality Incremental advance

AI Analysis

This work addresses uncertainty estimation for drug discovery researchers, offering incremental improvements over existing conformal prediction methods.

The paper tackled the problem of improving uncertainty estimation in drug-target interaction prediction by analyzing cluster-conditioned conformal prediction methods, finding that nonconformity-based clustering yields the tightest intervals and most reliable subgroup coverage, especially in random and fully unseen drug-protein splits.

Accurate drug-target interaction (DTI) prediction with machine learning models is essential for drug discovery. Such models should also provide a credible representation of their uncertainty, but applying classical marginal conformal prediction (CP) in DTI prediction often overlooks variability across drug and protein subgroups. In this work, we analyze three cluster-conditioned CP methods for DTI prediction, and compare them with marginal and group-conditioned CP. Clusterings are obtained via nonconformity scores, feature similarity, and nearest neighbors, respectively. Experiments on the KIBA dataset using four data-splitting strategies show that nonconformity-based clustering yields the tightest intervals and most reliable subgroup coverage, especially in random and fully unseen drug-protein splits. Group-conditioned CP works well when one entity is familiar, but residual-driven clustering provides robust uncertainty estimates even in sparse or novel scenarios. These results highlight the potential of cluster-based CP for improving DTI prediction under uncertainty.

View on arXiv PDF

Similar