Explainable AI by BAPC -- Before and After correction Parameter Comparison
This work addresses the need for interpretability in AI for users requiring transparent decision-making, but it appears incremental as it builds on existing surrogate explanation methods.
The paper tackles the problem of explaining AI predictions by introducing a local surrogate method that corrects a simpler linear base model, showing that under certain noise conditions, the method achieves neighborhoods with optimal accuracy and fidelity.
A local surrogate for an AI-model correcting a simpler 'base' model is introduced representing an analytical method to yield explanations of AI-predictions. The approach is studied here in the context of the base model being linear regression. The AI-model approximates the residual error of the linear model and the explanations are formulated in terms of the change of the interpretable base model's parameters. Criteria are formulated for the precise relation between lost accuracy of the surrogate, the accuracy of the AI-model, and the surrogate fidelity. It is shown that, assuming a certain maximal amount of noise in the observed data, these criteria induce neighborhoods of the instances to be explained which have an ideal size in terms of maximal accuracy and fidelity.