LGOct 13, 2021

Bond Default Prediction with Text Embeddings, Undersampling and Deep Learning

arXiv:2110.07035v1
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of reducing capital costs for local public goods by improving default prediction for municipal bonds, though it is incremental as it applies existing techniques to a new domain.

The paper tackles the problem of predicting municipal bond defaults, which are extremely rare (less than 0.2% default rate), by combining text embeddings, a neural network, and oversampling. It achieves a result of predicting 9 out of 10 defaults at issuance with false positives on less than 0.1% of non-defaulting bonds, outperforming human estimates and traditional models.

The special and important problems of default prediction for municipal bonds are addressed using a combination of text embeddings from a pre-trained transformer network, a fully connected neural network, and synthetic oversampling. The combination of these techniques provides significant improvement in performance over human estimates, linear models, and boosted ensemble models, on data with extreme imbalance. Less than 0.2% of municipal bonds default, but our technique predicts 9 out of 10 defaults at the time of issue, without using bond ratings, at a cost of false positives on less than 0.1% non-defaulting bonds. The results hold the promise of reducing the cost of capital for local public goods, which are vital for society, and bring techniques previously used in personal credit and public equities (or national fixed income), as well as the current generation of embedding techniques, to sub-sovereign credit decisions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes