AICRLGDec 31, 2024

Extending XReason: Formal Explanations for Adversarial Detection

arXiv:2501.00537v13 citationsh-index: 30
Originality Synthesis-oriented
AI Analysis

This work addresses the need for formal guarantees in explainable AI for cybersecurity, but it is incremental as it builds on an existing tool.

The paper tackles the problem of providing formal explanations for machine learning models by extending the XReason tool to support LightGBM models and class-level explanations, and implementing adversarial example generation and detection, evaluating it on the CICIDS-2017 dataset for network attack detection.

Explainable Artificial Intelligence (XAI) plays an important role in improving the transparency and reliability of complex machine learning models, especially in critical domains such as cybersecurity. Despite the prevalence of heuristic interpretation methods such as SHAP and LIME, these techniques often lack formal guarantees and may produce inconsistent local explanations. To fulfill this need, few tools have emerged that use formal methods to provide formal explanations. Among these, XReason uses a SAT solver to generate formal instance-level explanation for XGBoost models. In this paper, we extend the XReason tool to support LightGBM models as well as class-level explanations. Additionally, we implement a mechanism to generate and detect adversarial examples in XReason. We evaluate the efficiency and accuracy of our approach on the CICIDS-2017 dataset, a widely used benchmark for detecting network attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes