SECYLGJul 1, 2024

FairLay-ML: Intuitive Debugging of Fairness in Data-Driven Social-Critical Software

arXiv:2407.01423v23 citationsh-index: 9Has Code
Originality Synthesis-oriented
AI Analysis

It addresses fairness issues for developers and users of social-critical software, but is incremental as it builds on existing debugging and fairness testing methods.

The paper tackles the problem of debugging fairness in data-driven software by introducing FairLay-ML, a tool that visualizes datasets, models, and decisions, and includes counterfactual fairness testing, with studies measuring false positives/negatives and human perception of test cases.

Data-driven software solutions have significantly been used in critical domains with significant socio-economic, legal, and ethical implications. The rapid adoptions of data-driven solutions, however, pose major threats to the trustworthiness of automated decision-support software. A diminished understanding of the solution by the developer and historical/current biases in the data sets are primary challenges. To aid data-driven software developers and end-users, we present FairLay-ML, a debugging tool to test and explain the fairness implications of data-driven solutions. FairLay-ML visualizes the logic of datasets, trained models, and decisions for a given data point. In addition, it trains various models with varying fairness-accuracy trade-offs. Crucially, FairLay-ML incorporates counterfactual fairness testing that finds bugs beyond the development datasets. We conducted two studies through FairLay-ML that allowed us to measure false positives/negatives in prevalent counterfactual testing and understand the human perception of counterfactual test cases in a class survey. FairLay-ML and its benchmarks are publicly available at https://github.com/Pennswood/FairLay-ML. The live version of the tool is available at https://fairlayml-v2.streamlit.app/. We provide a video demo of the tool at https://youtu.be/wNI9UWkywVU?t=133.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes