ML LG APJan 16, 2020

Machine learning for total cloud cover prediction

Ágnes Baran, Sebastian Lerch, Mehrez El Ayari, Sándor Baran

arXiv:2001.05948v12 citations

AI Analysis

This work addresses the need for better TCC predictions in fields like astronomy and energy, but it is incremental as it applies existing machine learning methods to a known calibration problem in meteorology.

The paper tackled the problem of improving total cloud cover (TCC) ensemble forecasts, which are often uncalibrated and less skillful than other weather variables, by applying machine learning methods for post-processing; results showed that all calibration methods significantly improved forecast skill, with MLP, POLR, and GBM performing best, and incorporating precipitation forecasts led to further gains, especially with extended MLP models for most lead times.

Accurate and reliable forecasting of total cloud cover (TCC) is vital for many areas such as astronomy, energy demand and production, or agriculture. Most meteorological centres issue ensemble forecasts of TCC, however, these forecasts are often uncalibrated and exhibit worse forecast skill than ensemble forecasts of other weather variables. Hence, some form of post-processing is strongly required to improve predictive performance. As TCC observations are usually reported on a discrete scale taking just nine different values called oktas, statistical calibration of TCC ensemble forecasts can be considered a classification problem with outputs given by the probabilities of the oktas. This is a classical area where machine learning methods are applied. We investigate the performance of post-processing using multilayer perceptron (MLP) neural networks, gradient boosting machines (GBM) and random forest (RF) methods. Based on the European Centre for Medium-Range Weather Forecasts global TCC ensemble forecasts for 2002-2014 we compare these approaches with the proportional odds logistic regression (POLR) and multiclass logistic regression (MLR) models, as well as the raw TCC ensemble forecasts. We further assess whether improvements in forecast skill can be obtained by incorporating ensemble forecasts of precipitation as additional predictor. Compared to the raw ensemble, all calibration methods result in a significant improvement in forecast skill. RF models provide the smallest increase in predictive performance, while MLP, POLR and GBM approaches perform best. The use of precipitation forecast data leads to further improvements in forecast skill and except for very short lead times the extended MLP model shows the best overall performance.

View on arXiv PDF

Similar