HCAILGOct 11, 2025

Measuring What Matters: Connecting AI Ethics Evaluations to System Attributes, Hazards, and Harms

arXiv:2510.10339v15 citationsh-index: 15Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of fragmented AI ethics evaluations for researchers and practitioners, highlighting incremental insights into current practices.

The paper analyzed nearly 800 AI ethics evaluation measures from the literature, finding that they are fragmented and primarily focus on fairness, transparency, privacy, and trust, with limited coverage of system interactions and harms.

Over the past decade, an ecosystem of measures has emerged to evaluate the social and ethical implications of AI systems, largely shaped by high-level ethics principles. These measures are developed and used in fragmented ways, without adequate attention to how they are situated in AI systems. In this paper, we examine how existing measures used in the computing literature map to AI system components, attributes, hazards, and harms. Our analysis draws on a scoping review resulting in nearly 800 measures corresponding to 11 AI ethics principles. We find that most measures focus on four principles - fairness, transparency, privacy, and trust - and primarily assess model or output system components. Few measures account for interactions across system elements, and only a narrow set of hazards is typically considered for each harm type. Many measures are disconnected from where harm is experienced and lack guidance for setting meaningful thresholds. These patterns reveal how current evaluation practices remain fragmented, measuring in pieces rather than capturing how harms emerge across systems. Framing measures with respect to system attributes, hazards, and harms can strengthen regulatory oversight, support actionable practices in industry, and ground future research in systems-level understanding.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes