CYAIMay 14, 2025

Healthy Distrust in AI systems

arXiv:2505.09747v11 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

This work addresses the need for a more nuanced understanding of trust and distrust in AI to protect human autonomy, though it is incremental as it builds on existing interdisciplinary research.

The paper tackles the problem of overemphasizing trust in AI systems by arguing that justified distrust is necessary for meaningful trust, especially when AI is used in ways that conflict with human interests. It introduces the concept of 'healthy distrust' as a careful stance to respect human autonomy in AI usage.

Under the slogan of trustworthy AI, much of contemporary AI research is focused on designing AI systems and usage practices that inspire human trust and, thus, enhance adoption of AI systems. However, a person affected by an AI system may not be convinced by AI system design alone -- neither should they, if the AI system is embedded in a social context that gives good reason to believe that it is used in tension with a person's interest. In such cases, distrust in the system may be justified and necessary to build meaningful trust in the first place. We propose the term "healthy distrust" to describe such a justified, careful stance towards certain AI usage practices. We investigate prior notions of trust and distrust in computer science, sociology, history, psychology, and philosophy, outline a remaining gap that healthy distrust might fill and conceptualize healthy distrust as a crucial part for AI usage that respects human autonomy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes