CVAIJan 25, 2024

Respect the model: Fine-grained and Robust Explanation with Sharing Ratio Decomposition

arXiv:2402.03348v26 citationsICLR
Originality Incremental advance
AI Analysis

This addresses the need for more truthful and robust explanations in AI for researchers and practitioners, though it appears incremental as it builds on existing XAI methods with a new vector perspective.

The paper tackles the problem of existing explanation methods not faithfully representing model decisions and being vulnerable to adversarial attacks by proposing SRD, a novel XAI method that reflects the model's inference process, resulting in significantly enhanced robustness in explanations.

The truthfulness of existing explanation methods in authentically elucidating the underlying model's decision-making process has been questioned. Existing methods have deviated from faithfully representing the model, thus susceptible to adversarial attacks. To address this, we propose a novel eXplainable AI (XAI) method called SRD (Sharing Ratio Decomposition), which sincerely reflects the model's inference process, resulting in significantly enhanced robustness in our explanations. Different from the conventional emphasis on the neuronal level, we adopt a vector perspective to consider the intricate nonlinear interactions between filters. We also introduce an interesting observation termed Activation-Pattern-Only Prediction (APOP), letting us emphasize the importance of inactive neurons and redefine relevance encapsulating all relevant information including both active and inactive neurons. Our method, SRD, allows for the recursive decomposition of a Pointwise Feature Vector (PFV), providing a high-resolution Effective Receptive Field (ERF) at any layer.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes