HC AIMay 18

Exploring Trust Calibration in XAI - The Impact of Exposing Model Limitations to Lay Users

Alfio Ventura, Tim Katzke, Jan Corazza, Mustafa Yalçıner

arXiv:2605.1803621.2

Predicted impact top 74% in HC · last 90 daysOriginality Synthesis-oriented

AI Analysis

For XAI practitioners and researchers, this work provides evidence on how to design limitation communication to improve trust calibration, though the effect is limited to case-wise measures.

This study investigates trust calibration in XAI by exposing model limitations to lay users in a skin-lesion classification task. Results show that only limitation disclosure for case-wise measures reliably impacts trust calibration, while short-term experience does not yield progressive calibration.

Trust calibration -- aligning user trust judgment with model capability -- is crucial for safe deployment of explainable AI (XAI), yet is often evaluated via global trust ratings detached from objective performance evidence. We present a preregistered, incentivized between-subject online study (N=418 representative UK sample) on explainable skin-lesion classification that disentangles expectation-setting from experienced performance. Participants completed 15 case evaluations using a fixed XAI panel (malignancy score, reliability score, and saliency map). We systematically manipulated five experimental onboarding conditions varying example-based information and limitation disclosures with five stimulus packages naturally varying observed prediction quality. Calibration was operationalized as the deviation between trust-related judgments (TAIS and case-wise ratings) and objective performance benchmarks for the encountered cases, analysed with hierarchical mixed-effects models. Only limitation disclosure for case-wise measures reliably impacts trust calibration, and short-term experience did not yield progressive calibration. Further, the experienced package of stimuli explained substantially more variance than the experimental manipulation. However, participants were hard-pressed to differentiate between case-wise perceived trust, trustworthiness, and accuracy estimation. We discuss implications for designing limitation communication and for measuring and analysing calibration metrics in XAI evaluations. All study materials and data of this study are publicly available for replication and further academic use.

View on arXiv PDF

Similar