ITCRLGSPOct 17, 2024

Private Counterfactual Retrieval

arXiv:2410.13812v21 citationsh-index: 63
Originality Incremental advance
AI Analysis

This work addresses privacy concerns for users and institutions in high-stake applications using counterfactual explanations, though it is incremental as it builds on existing PIR techniques.

The paper tackles the privacy risks in retrieving counterfactual explanations from black-box ML models by proposing schemes inspired by private information retrieval to ensure user privacy, achieving perfect privacy for users with quantified database leakage and empirical validation on real datasets.

Transparency and explainability are two extremely important aspects to be considered when employing black-box machine learning models in high-stake applications. Providing counterfactual explanations is one way of fulfilling this requirement. However, this also poses a threat to the privacy of both the institution that is providing the explanation as well as the user who is requesting it. In this work, we propose multiple schemes inspired by private information retrieval (PIR) techniques which ensure the \emph{user's privacy} when retrieving counterfactual explanations. We present a scheme which retrieves the \emph{exact} nearest neighbor counterfactual explanation from a database of accepted points while achieving perfect (information-theoretic) privacy for the user. While the scheme achieves perfect privacy for the user, some leakage on the database is inevitable which we quantify using a mutual information based metric. Furthermore, we propose strategies to reduce this leakage to achieve an advanced degree of database privacy. We extend these schemes to incorporate user's preference on transforming their attributes, so that a more actionable explanation can be received. Since our schemes rely on finite field arithmetic, we empirically validate our schemes on real datasets to understand the trade-off between the accuracy and the finite field sizes. Finally, we present numerical results to support our theoretical findings, and compare the database leakage of the proposed schemes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes