MLLGMar 14, 2024

Pantypes: Diverse Representatives for Self-Explainable Models

arXiv:2403.09383v16 citationsAAAI
Originality Incremental advance
AI Analysis

This addresses representation bias for users of interpretable AI systems, offering an incremental improvement over existing prototypical models.

The paper tackles the problem of representation bias in prototypical self-explainable classifiers, where learned prototypes fail to capture low-density regions of the input distribution, and introduces pantypes to improve diversity, interpretability, and fairness.

Prototypical self-explainable classifiers have emerged to meet the growing demand for interpretable AI systems. These classifiers are designed to incorporate high transparency in their decisions by basing inference on similarity with learned prototypical objects. While these models are designed with diversity in mind, the learned prototypes often do not sufficiently represent all aspects of the input distribution, particularly those in low density regions. Such lack of sufficient data representation, known as representation bias, has been associated with various detrimental properties related to machine learning diversity and fairness. In light of this, we introduce pantypes, a new family of prototypical objects designed to capture the full diversity of the input distribution through a sparse set of objects. We show that pantypes can empower prototypical self-explainable models by occupying divergent regions of the latent space and thus fostering high diversity, interpretability and fairness.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes