LG AI CE CP RMMay 21, 2022

Deep Learning vs. Gradient Boosting: Benchmarking state-of-the-art machine learning algorithms for credit scoring

arXiv:2205.10535v19.626 citationsh-index: 5

Originality Synthesis-oriented

AI Analysis

This study provides practical guidance for financial services companies on model selection for credit risk management, though it is incremental as it compares existing methods.

The paper benchmarked deep learning and gradient boosting machines for credit scoring using three datasets, finding that GBM generally outperforms DL in accuracy and speed, making it the preferred choice for most scenarios.

Artificial intelligence (AI) and machine learning (ML) have become vital to remain competitive for financial services companies around the globe. The two models currently competing for the pole position in credit risk management are deep learning (DL) and gradient boosting machines (GBM). This paper benchmarked those two algorithms in the context of credit scoring using three distinct datasets with different features to account for the reality that model choice/power is often dependent on the underlying characteristics of the dataset. The experiment has shown that GBM tends to be more powerful than DL and has also the advantage of speed due to lower computational requirements. This makes GBM the winner and choice for credit scoring. However, it was also shown that the outperformance of GBM is not always guaranteed and ultimately the concrete problem scenario or dataset will determine the final model choice. Overall, based on this study both algorithms can be considered state-of-the-art for binary classification tasks on structured datasets, while GBM should be the go-to solution for most problem scenarios due to easier use, significantly faster training time, and superior accuracy.

View on arXiv PDF

Similar