Debiasing Alternative Data for Credit Underwriting Using Causal Inference
This addresses the issue of discrimination in credit access for underserved groups, offering a solution to enable fairer lending practices.
The paper tackles the problem of alternative data in credit underwriting potentially acting as illegal proxies for protected classes like race or gender, causing redlining, and proposes a method using causal inference to debias such data, demonstrating improved model accuracy across racial groups with nondiscrimination guarantees.
Alternative data provides valuable insights for lenders to evaluate a borrower's creditworthiness, which could help expand credit access to underserved groups and lower costs for borrowers. But some forms of alternative data have historically been excluded from credit underwriting because it could act as an illegal proxy for a protected class like race or gender, causing redlining. We propose a method for applying causal inference to a supervised machine learning model to debias alternative data so that it might be used for credit underwriting. We demonstrate how our algorithm can be used against a public credit dataset to improve model accuracy across different racial groups, while providing theoretically robust nondiscrimination guarantees.