AIAug 19, 2025

ITL-LIME: Instance-Based Transfer Learning for Enhancing Local Explanations in Low-Resource Data Settings

Rehan Raza, Guanjin Wang, Kok Wai Wong, Hamid Laga, Marco Fisichella

arXiv:2508.13672v23.32 citationsh-index: 7CIKM

Originality Incremental advance

AI Analysis

This addresses the challenge of generating reliable local explanations for black-box models in low-resource data settings, which is an incremental advancement in XAI methods.

The paper tackles the problem of instability and inaccuracy in LIME explanations due to randomness and data scarcity by proposing ITL-LIME, which uses instance-based transfer learning from a source domain to enhance explanation fidelity and stability, achieving improvements in metrics like fidelity and stability scores.

Explainable Artificial Intelligence (XAI) methods, such as Local Interpretable Model-Agnostic Explanations (LIME), have advanced the interpretability of black-box machine learning models by approximating their behavior locally using interpretable surrogate models. However, LIME's inherent randomness in perturbation and sampling can lead to locality and instability issues, especially in scenarios with limited training data. In such cases, data scarcity can result in the generation of unrealistic variations and samples that deviate from the true data manifold. Consequently, the surrogate model may fail to accurately approximate the complex decision boundary of the original model. To address these challenges, we propose a novel Instance-based Transfer Learning LIME framework (ITL-LIME) that enhances explanation fidelity and stability in data-constrained environments. ITL-LIME introduces instance transfer learning into the LIME framework by leveraging relevant real instances from a related source domain to aid the explanation process in the target domain. Specifically, we employ clustering to partition the source domain into clusters with representative prototypes. Instead of generating random perturbations, our method retrieves pertinent real source instances from the source cluster whose prototype is most similar to the target instance. These are then combined with the target instance's neighboring real instances. To define a compact locality, we further construct a contrastive learning-based encoder as a weighting mechanism to assign weights to the instances from the combined set based on their proximity to the target instance. Finally, these weighted source and target instances are used to train the surrogate model for explanation purposes.

View on arXiv PDF

Similar