LGAPJun 7, 2021

Hybrid Machine Learning Forecasts for the UEFA EURO 2020

arXiv:2106.05799v1
Originality Synthesis-oriented
AI Analysis

This work addresses forecasting accuracy for sports analysts and bettors, but it is incremental as it builds on existing methods.

The authors tackled the problem of forecasting football match outcomes by combining three statistical ranking methods with various team and socio-economic predictors in a hybrid machine learning model, applied to UEFA EURO 2020, resulting in France being favored with a 14.8% winning probability.

Three state-of-the-art statistical ranking methods for forecasting football matches are combined with several other predictors in a hybrid machine learning model. Namely an ability estimate for every team based on historic matches; an ability estimate for every team based on bookmaker consensus; average plus-minus player ratings based on their individual performances in their home clubs and national teams; and further team covariates (e.g., market value, team structure) and country-specific socio-economic factors (population, GDP). The proposed combined approach is used for learning the number of goals scored in the matches from the four previous UEFA EUROs 2004-2016 and then applied to current information to forecast the upcoming UEFA EURO 2020. Based on the resulting estimates, the tournament is simulated repeatedly and winning probabilities are obtained for all teams. A random forest model favors the current World Champion France with a winning probability of 14.8% before England (13.5%) and Spain (12.3%). Additionally, we provide survival probabilities for all teams and at all tournament stages.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes