LGAIOct 14, 2025

A Multimodal XAI Framework for Trustworthy CNNs and Bias Detection in Deep Representation Learning

arXiv:2510.12957v1h-index: 1
Originality Incremental advance
AI Analysis

This work addresses the need for trustworthy AI in sensitive domains by enhancing transparency and fairness, though it is incremental as it builds on existing XAI methods.

The paper tackled the problem of latent biases and limited trustworthiness in deep neural networks for high-stakes applications by proposing a multimodal Explainable AI framework, achieving 93.2% classification accuracy, 91.6% F1-score, and 78.1% explanation fidelity on multimodal MNIST extensions.

Standard benchmark datasets, such as MNIST, often fail to expose latent biases and multimodal feature complexities, limiting the trustworthiness of deep neural networks in high-stakes applications. We propose a novel multimodal Explainable AI (XAI) framework that unifies attention-augmented feature fusion, Grad-CAM++-based local explanations, and a Reveal-to-Revise feedback loop for bias detection and mitigation. Evaluated on multimodal extensions of MNIST, our approach achieves 93.2% classification accuracy, 91.6% F1-score, and 78.1% explanation fidelity (IoU-XAI), outperforming unimodal and non-explainable baselines. Ablation studies demonstrate that integrating interpretability with bias-aware learning enhances robustness and human alignment. Our work bridges the gap between performance, transparency, and fairness, highlighting a practical pathway for trustworthy AI in sensitive domains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes