ProDG: Prototypes for Data-Free Generative Post-Hoc Explainability
This work addresses the data dependency bottleneck in prototype-based post-hoc explainability, enabling interpretability in privacy-sensitive applications.
ProDG introduces a data-free post-hoc explainability framework that generates high-fidelity prototypes directly from a frozen model's weights, eliminating the need for any external data. It achieves robust visual interpretability for privacy-sensitive domains where data is inaccessible.
Ante-hoc interpretability methods based on prototypes provide highly accurate explanations by utilizing the intuitive "this looks like that" reasoning paradigm. On the other hand, post-hoc models can explain predictions for a single image without relying on an underlying dataset or requiring costly neural network retraining. Recent approaches successfully solve the retraining problem for prototype-based networks. However, they still face a fundamental limitation: they require access to a subset of data (e.g., a test or validation set) to search for and extract the visual prototypes. In this paper, we address this issue and introduce ProDG: Generative Prototypes for Data-Free Post-Hoc Explainability, a novel framework that leverages generative models to synthesize pure, high-fidelity prototypes directly from the frozen model's weights, completely eliminating the dependency on any external data. By establishing this new frontier in Data-Free XAI, ProDG unlocks robust visual interpretability for privacy-sensitive domains, where original data is strictly restricted or fundamentally inaccessible. Project page: https://github.com/piotr310100/ProDG