CYLGFeb 3, 2025

Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool

arXiv:2502.01713v21 citationsh-index: 5Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses bias auditing in high-stakes governmental decision-making, providing a practical tool and guidelines for external auditors under privacy constraints, though it is incremental in applying existing methods to a new real-world context.

The paper tackled bias detection in a Dutch public sector algorithm used for risk profiling college students when demographic data is unavailable, using an unsupervised clustering tool to highlight disparities between students with non-European migration backgrounds and Dutch origin, covering over 250,000 students.

Algorithms are increasingly used to automate or aid human decisions, yet recent research shows that these algorithms may exhibit bias across legally protected demographic groups. However, data on these groups may be unavailable to organizations or external auditors due to privacy legislation. This paper studies bias detection using an unsupervised clustering tool when data on demographic groups are unavailable. We collaborate with the Dutch Executive Agency for Education to audit an algorithm that was used to assign risk scores to college students at the national level in the Netherlands between 2012-2023. Our audit covers more than 250,000 students from the whole country. The unsupervised clustering tool highlights known disparities between students with a non-European migration background and Dutch origin. Our contributions are three-fold: (1) we assess bias in a real-world, large-scale and high-stakes decision-making process by a governmental organization; (2) we use simulation studies to highlight potential pitfalls of using the unsupervised clustering tool to detect true bias when demographic group data are unavailable and provide recommendations for valid inferences; (3) we provide the unsupervised clustering tool in an open-source library. Our work serves as a starting point for a deliberative assessment by human experts to evaluate potential discrimination in algorithmic-supported decision-making processes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes