LGDec 11, 2025

DCFO: Density-Based Counterfactuals for Outliers - Additional Material

arXiv:2512.10659v2
Originality Highly original
AI Analysis

This work addresses the interpretability gap for a widely used outlier detection algorithm, providing actionable insights for practitioners in fields relying on LOF.

The paper tackles the problem of explaining outliers detected by the Local Outlier Factor (LOF) method by introducing DCFO, a novel density-based counterfactual explanation method that partitions the data space for efficient optimization, and it demonstrates superior performance in proximity and validity on 50 OpenML datasets.

Outlier detection identifies data points that significantly deviate from the majority of the data distribution. Explaining outliers is crucial for understanding the underlying factors that contribute to their detection, validating their significance, and identifying potential biases or errors. Effective explanations provide actionable insights, facilitating preventive measures to avoid similar outliers in the future. Counterfactual explanations clarify why specific data points are classified as outliers by identifying minimal changes required to alter their prediction. Although valuable, most existing counterfactual explanation methods overlook the unique challenges posed by outlier detection, and fail to target classical, widely adopted outlier detection algorithms. Local Outlier Factor (LOF) is one the most popular unsupervised outlier detection methods, quantifying outlierness through relative local density. Despite LOF's widespread use across diverse applications, it lacks interpretability. To address this limitation, we introduce Density-based Counterfactuals for Outliers (DCFO), a novel method specifically designed to generate counterfactual explanations for LOF. DCFO partitions the data space into regions where LOF behaves smoothly, enabling efficient gradient-based optimisation. Extensive experimental validation on 50 OpenML datasets demonstrates that DCFO consistently outperforms benchmarked competitors, offering superior proximity and validity of generated counterfactuals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes