AI CLAug 8, 2024

SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals

arXiv:2408.04575v22.31 citationsh-index: 4

Originality Incremental advance

AI Analysis

This addresses the need for standardized evaluation of XAI methods in NLP, though it is incremental as it builds on existing counterfactual approaches.

The paper tackles the problem of evaluating explainable AI (XAI) methods in NLP, which are often unstable and misleading, by introducing SCENE, a novel evaluation method that uses large language models to generate soft counterfactual explanations, achieving insights into XAI techniques across different architectures.

Explainable Artificial Intelligence (XAI) plays a crucial role in enhancing the transparency and accountability of AI models, particularly in natural language processing (NLP) tasks. However, popular XAI methods such as LIME and SHAP have been found to be unstable and potentially misleading, underscoring the need for a standardized evaluation approach. This paper introduces SCENE (Soft Counterfactual Evaluation for Natural language Explainability), a novel evaluation method that leverages large language models (LLMs) to generate Soft Counterfactual explanations in a zero-shot manner. By focusing on token-based substitutions, SCENE creates contextually appropriate and semantically meaningful Soft Counterfactuals without extensive fine-tuning. SCENE adopts Validitysoft and Csoft metrics to assess the effectiveness of model-agnostic XAI methods in text classification tasks. Applied to CNN, RNN, and Transformer architectures, SCENE provides valuable insights into the strengths and limitations of various XAI techniques.

View on arXiv PDF

Similar