CYAILGJul 26, 2019

How model accuracy and explanation fidelity influence user trust

arXiv:1907.12652v1136 citations
Originality Synthesis-oriented
AI Analysis

This research addresses the problem of user trust in explainable AI for engineers and users, though it is incremental as it builds on existing work in the field.

The study investigated how model accuracy and explanation fidelity affect user trust in machine learning systems, finding that accuracy is more important for trust than explainability, and that explanations can harm trust if nonsensical.

Machine learning systems have become popular in fields such as marketing, financing, or data mining. While they are highly accurate, complex machine learning systems pose challenges for engineers and users. Their inherent complexity makes it impossible to easily judge their fairness and the correctness of statistically learned relations between variables and classes. Explainable AI aims to solve this challenge by modelling explanations alongside with the classifiers, potentially improving user trust and acceptance. However, users should not be fooled by persuasive, yet untruthful explanations. We therefore conduct a user study in which we investigate the effects of model accuracy and explanation fidelity, i.e. how truthfully the explanation represents the underlying model, on user trust. Our findings show that accuracy is more important for user trust than explainability. Adding an explanation for a classification result can potentially harm trust, e.g. when adding nonsensical explanations. We also found that users cannot be tricked by high-fidelity explanations into having trust for a bad classifier. Furthermore, we found a mismatch between observed (implicit) and self-reported (explicit) trust.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes