EM LG MLDec 17, 2024

Dual Interpretation of Machine Learning Forecasts

Philippe Goulet Coulombe, Maximilian Goebel, Karin Klieber

arXiv:2412.13076v14 citationsh-index: 8

Originality Incremental advance

AI Analysis

This provides a new interpretability tool for analysts in macroeconomic forecasting, though it is incremental as it builds on existing methods.

The paper tackles the problem of interpreting machine learning forecasts by proposing a dual interpretation method that expresses predictions as linear combinations of historical analogies, and applies it to macroeconomic forecasting to analyze inflation, GDP growth, and recession probabilities, demonstrating how models leverage historical patterns.

Machine learning predictions are typically interpreted as the sum of contributions of predictors. Yet, each out-of-sample prediction can also be expressed as a linear combination of in-sample values of the predicted variable, with weights corresponding to pairwise proximity scores between current and past economic events. While this dual route leads nowhere in some contexts (e.g., large cross-sectional datasets), it provides sparser interpretations in settings with many regressors and little training data-like macroeconomic forecasting. In this case, the sequence of contributions can be visualized as a time series, allowing analysts to explain predictions as quantifiable combinations of historical analogies. Moreover, the weights can be viewed as those of a data portfolio, inspiring new diagnostic measures such as forecast concentration, short position, and turnover. We show how weights can be retrieved seamlessly for (kernel) ridge regression, random forest, boosted trees, and neural networks. Then, we apply these tools to analyze post-pandemic forecasts of inflation, GDP growth, and recession probabilities. In all cases, the approach opens the black box from a new angle and demonstrates how machine learning models leverage history partly repeating itself.

View on arXiv PDF

Similar