Accuracy, Fairness, and Interpretability of Machine Learning Criminal Recidivism Models
This addresses concerns about the reliability and ethical use of recidivism models in parole decisions, though it is incremental as it compares existing methods on a new dataset.
The paper tackled the problem of evaluating machine learning models for criminal recidivism prediction, finding that there are trade-offs between accuracy, fairness, and interpretability, with no single model performing best across all criteria.
Criminal recidivism models are tools that have gained widespread adoption by parole boards across the United States to assist with parole decisions. These models take in large amounts of data about an individual and then predict whether an individual would commit a crime if released on parole. Although such models are not the only or primary factor in making the final parole decision, questions have been raised about their accuracy, fairness, and interpretability. In this paper, various machine learning-based criminal recidivism models are created based on a real-world parole decision dataset from the state of Georgia in the United States. The recidivism models are comparatively evaluated for their accuracy, fairness, and interpretability. It is found that there are noted differences and trade-offs between accuracy, fairness, and being inherently interpretable. Therefore, choosing the best model depends on the desired balance between accuracy, fairness, and interpretability, as no model is perfect or consistently the best across different criteria.