Consensus statement on the credibility assessment of ML predictors
It addresses the critical need for reliable ML predictors in clinical and biomedical contexts, providing guidance for researchers, developers, and regulators, though it is incremental as it builds on existing frameworks.
This paper tackles the problem of assessing the credibility of machine learning predictors in high-stakes healthcare by presenting a consensus statement with twelve key recommendations, focusing on causal knowledge, error quantification, and robustness to biases.
The rapid integration of machine learning (ML) predictors into in silico medicine has revolutionized the estimation of quantities of interest (QIs) that are otherwise challenging to measure directly. However, the credibility of these predictors is critical, especially when they inform high-stakes healthcare decisions. This position paper presents a consensus statement developed by experts within the In Silico World Community of Practice. We outline twelve key statements forming the theoretical foundation for evaluating the credibility of ML predictors, emphasizing the necessity of causal knowledge, rigorous error quantification, and robustness to biases. By comparing ML predictors with biophysical models, we highlight unique challenges associated with implicit causal knowledge and propose strategies to ensure reliability and applicability. Our recommendations aim to guide researchers, developers, and regulators in the rigorous assessment and deployment of ML predictors in clinical and biomedical contexts.