HCAICYLGApr 29, 2025

In defence of post-hoc explanations in medical AI

arXiv:2504.20741v11 citationsh-index: 10The Hastings center report
Originality Synthesis-oriented
AI Analysis

This addresses the problem of trust and safety in black box medical AI systems for clinicians and patients, but it is incremental as it builds on existing debates without introducing new methods.

The paper defends the value of post-hoc explanations in medical AI against critiques that they do not replicate black box reasoning, arguing they can improve user understanding, increase clinician-AI team accuracy, and assist in decision justification.

Since the early days of the Explainable AI movement, post-hoc explanations have been praised for their potential to improve user understanding, promote trust, and reduce patient safety risks in black box medical AI systems. Recently, however, critics have argued that the benefits of post-hoc explanations are greatly exaggerated since they merely approximate, rather than replicate, the actual reasoning processes that black box systems take to arrive at their outputs. In this article, we aim to defend the value of post-hoc explanations against this recent critique. We argue that even if post-hoc explanations do not replicate the exact reasoning processes of black box systems, they can still improve users' functional understanding of black box systems, increase the accuracy of clinician-AI teams, and assist clinicians in justifying their AI-informed decisions. While post-hoc explanations are not a "silver bullet" solution to the black box problem in medical AI, we conclude that they remain a useful strategy for addressing the black box problem in medical AI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes