LGAIARIVFeb 22, 2023

Non-Uniform Interpolation in Integrated Gradients for Low-Latency Explainable-AI

arXiv:2302.11107v16 citationsh-index: 50
Originality Incremental advance
AI Analysis

This addresses the problem of real-time XAI for users needing fast explanations, but it is incremental as it optimizes an existing method.

The paper tackles the computational overhead of Integrated Gradients in Explainable-AI by proposing a non-uniform interpolation scheme, achieving a 2.6-3.6× speedup on GPU systems for iso-convergence with minimal latency overhead.

There has been a surge in Explainable-AI (XAI) methods that provide insights into the workings of Deep Neural Network (DNN) models. Integrated Gradients (IG) is a popular XAI algorithm that attributes relevance scores to input features commensurate with their contribution to the model's output. However, it requires multiple forward \& backward passes through the model. Thus, compared to a single forward-pass inference, there is a significant computational overhead to generate the explanation which hinders real-time XAI. This work addresses the aforementioned issue by accelerating IG with a hardware-aware algorithm optimization. We propose a novel non-uniform interpolation scheme to compute the IG attribution scores which replaces the baseline uniform interpolation. Our algorithm significantly reduces the total interpolation steps required without adversely impacting convergence. Experiments on the ImageNet dataset using a pre-trained InceptionV3 model demonstrate \textit{2.6-3.6}$\times$ performance speedup on GPU systems for iso-convergence. This includes the minimal \textit{0.2-3.2}\% latency overhead introduced by the pre-processing stage of computing the non-uniform interpolation step-sizes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes