CVAILGSep 26, 2022

Greybox XAI: a Neural-Symbolic learning framework to produce interpretable predictions for image classification

arXiv:2209.14974v132 citationsh-index: 51
Originality Incremental advance
AI Analysis

This addresses the need for trustworthy AI in critical applications by providing interpretable predictions, though it is incremental as it builds on existing XAI methods.

The authors tackled the problem of deep neural networks lacking interpretability by proposing Greybox XAI, a neural-symbolic framework that combines a DNN with a transparent model using a symbolic knowledge base, resulting in accurate and explainable predictions across multiple datasets.

Although Deep Neural Networks (DNNs) have great generalization and prediction capabilities, their functioning does not allow a detailed explanation of their behavior. Opaque deep learning models are increasingly used to make important predictions in critical environments, and the danger is that they make and use predictions that cannot be justified or legitimized. Several eXplainable Artificial Intelligence (XAI) methods that separate explanations from machine learning models have emerged, but have shortcomings in faithfulness to the model actual functioning and robustness. As a result, there is a widespread agreement on the importance of endowing Deep Learning models with explanatory capabilities so that they can themselves provide an answer to why a particular prediction was made. First, we address the problem of the lack of universal criteria for XAI by formalizing what an explanation is. We also introduced a set of axioms and definitions to clarify XAI from a mathematical perspective. Finally, we present the Greybox XAI, a framework that composes a DNN and a transparent model thanks to the use of a symbolic Knowledge Base (KB). We extract a KB from the dataset and use it to train a transparent model (i.e., a logistic regression). An encoder-decoder architecture is trained on RGB images to produce an output similar to the KB used by the transparent model. Once the two models are trained independently, they are used compositionally to form an explainable predictive model. We show how this new architecture is accurate and explainable in several datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes