EMLGMLDec 17, 2024

Dual Interpretation of Machine Learning Forecasts

arXiv:2412.13076v14 citationsh-index: 8
Originality Incremental advance
AI Analysis

This provides a new interpretability tool for analysts in macroeconomic forecasting, though it is incremental as it builds on existing methods.

The paper tackles the problem of interpreting machine learning forecasts by proposing a dual interpretation method that expresses predictions as linear combinations of historical analogies, and applies it to macroeconomic forecasting to analyze inflation, GDP growth, and recession probabilities, demonstrating how models leverage historical patterns.

Machine learning predictions are typically interpreted as the sum of contributions of predictors. Yet, each out-of-sample prediction can also be expressed as a linear combination of in-sample values of the predicted variable, with weights corresponding to pairwise proximity scores between current and past economic events. While this dual route leads nowhere in some contexts (e.g., large cross-sectional datasets), it provides sparser interpretations in settings with many regressors and little training data-like macroeconomic forecasting. In this case, the sequence of contributions can be visualized as a time series, allowing analysts to explain predictions as quantifiable combinations of historical analogies. Moreover, the weights can be viewed as those of a data portfolio, inspiring new diagnostic measures such as forecast concentration, short position, and turnover. We show how weights can be retrieved seamlessly for (kernel) ridge regression, random forest, boosted trees, and neural networks. Then, we apply these tools to analyze post-pandemic forecasts of inflation, GDP growth, and recession probabilities. In all cases, the approach opens the black box from a new angle and demonstrates how machine learning models leverage history partly repeating itself.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes