AICEETLGFeb 7, 2025

Bridging the Gap in XAI-Why Reliable Metrics Matter for Explainability and Compliance

arXiv:2502.04695v28 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

This work addresses the need for reliable explainability metrics to support private actors like auditors and insurers in high-stakes AI governance, though it is incremental as it builds on prior XAI benchmarking.

The paper tackles the problem of fragmented and manipulable explainability metrics in AI, proposing a Governance by Metrics paradigm to standardize evaluation for private oversight and compliance, linking interpretability to model alignment and accountability.

Reliable explainability is not only a technical goal but also a cornerstone of private AI governance. As AI models enter high-stakes sectors, private actors such as auditors, insurers, certification bodies, and procurement agencies require standardized evaluation metrics to assess trustworthiness. However, current XAI evaluation metrics remain fragmented and prone to manipulation, which undermines accountability and compliance. We argue that standardized metrics can function as governance primitives, embedding auditability and accountability within AI systems for effective private oversight. Building upon prior work in XAI benchmarking, we identify key limitations in ensuring faithfulness, tamper resistance, and regulatory alignment. Furthermore, interpretability can directly support model alignment by providing a verifiable means of ensuring behavioral integrity in General Purpose AI (GPAI) systems. This connection between interpretability and alignment positions XAI metrics as both technical and regulatory instruments that help prevent alignment faking, a growing concern among oversight bodies. We propose a Governance by Metrics paradigm that treats explainability evaluation as a central mechanism of private AI governance. Our framework introduces a hierarchical model linking transparency, tamper resistance, scalability, and legal alignment, extending evaluation from model introspection toward systemic accountability. Through conceptual synthesis and alignment with governance standards, we outline a roadmap for integrating explainability metrics into continuous AI assurance pipelines that serve both private oversight and regulatory needs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes