CYAIMay 15, 2023

Certification Labels for Trustworthy AI: Insights From an Empirical Mixed-Method Study

arXiv:2305.18307v140 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of making AI audits accessible to the public, offering practical insights for designing trustworthy AI systems, though it is incremental in building on existing audit frameworks.

The study tackled the problem of communicating AI audit trustworthiness to end-users by empirically investigating certification labels, finding that labels significantly increase trust and willingness to use AI in both low- and high-stakes scenarios, with effects more pronounced in high-stakes cases.

Auditing plays a pivotal role in the development of trustworthy AI. However, current research primarily focuses on creating auditable AI documentation, which is intended for regulators and experts rather than end-users affected by AI decisions. How to communicate to members of the public that an AI has been audited and considered trustworthy remains an open challenge. This study empirically investigated certification labels as a promising solution. Through interviews (N = 12) and a census-representative survey (N = 302), we investigated end-users' attitudes toward certification labels and their effectiveness in communicating trustworthiness in low- and high-stakes AI scenarios. Based on the survey results, we demonstrate that labels can significantly increase end-users' trust and willingness to use AI in both low- and high-stakes scenarios. However, end-users' preferences for certification labels and their effect on trust and willingness to use AI were more pronounced in high-stake scenarios. Qualitative content analysis of the interviews revealed opportunities and limitations of certification labels, as well as facilitators and inhibitors for the effective use of labels in the context of AI. For example, while certification labels can mitigate data-related concerns expressed by end-users (e.g., privacy and data protection), other concerns (e.g., model performance) are more challenging to address. Our study provides valuable insights and recommendations for designing and implementing certification labels as a promising constituent within the trustworthy AI ecosystem.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes