LGAINESep 19, 2024

Counterfactual Explanations for Clustering Models

arXiv:2409.12632v12 citationsh-index: 21
Originality Incremental advance
AI Analysis

This addresses the challenge of fostering trust and adoption in clustering algorithms for users, especially non-experts, by providing interpretable explanations, though it is incremental as it builds on existing supervised learning techniques.

The paper tackles the problem of explaining clustering models, which are difficult to comprehend and lack explainability techniques, by proposing a model-agnostic method using counterfactual statements with a novel soft-scoring approach. The result shows that this method significantly improves explanation quality, as evaluated on five datasets and two clustering algorithms.

Clustering algorithms rely on complex optimisation processes that may be difficult to comprehend, especially for individuals who lack technical expertise. While many explainable artificial intelligence techniques exist for supervised machine learning, unsupervised learning -- and clustering in particular -- has been largely neglected. To complicate matters further, the notion of a ``true'' cluster is inherently challenging to define. These facets of unsupervised learning and its explainability make it difficult to foster trust in such methods and curtail their adoption. To address these challenges, we propose a new, model-agnostic technique for explaining clustering algorithms with counterfactual statements. Our approach relies on a novel soft-scoring method that captures the spatial information utilised by clustering models. It builds upon a state-of-the-art Bayesian counterfactual generator for supervised learning to deliver high-quality explanations. We evaluate its performance on five datasets and two clustering algorithms, and demonstrate that introducing soft scores to guide counterfactual search significantly improves the results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes