LG AIDec 5, 2021

Probabilistic Deep Learning to Quantify Uncertainty in Air Quality Forecasting

Abdulmajid Murad, Frank Alexander Kraemer, Kerstin Bach, Gavin Taylor

arXiv:2112.02622v13.119 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This addresses the need for trustworthy air quality predictions for environmental monitoring and public health, though it is incremental as it applies existing uncertainty quantification methods to this domain.

This work tackles the problem of quantifying uncertainty in data-driven air quality forecasts by applying state-of-the-art probabilistic deep learning techniques, demonstrating that Bayesian neural networks provide more reliable uncertainty estimates while scalable methods like deep ensembles and MC dropout perform well with tradeoffs.

Data-driven forecasts of air quality have recently achieved more accurate short-term predictions. Despite their success, most of the current data-driven solutions lack proper quantifications of model uncertainty that communicate how much to trust the forecasts. Recently, several practical tools to estimate uncertainty have been developed in probabilistic deep learning. However, there have not been empirical applications and extensive comparisons of these tools in the domain of air quality forecasts. Therefore, this work applies state-of-the-art techniques of uncertainty quantification in a real-world setting of air quality forecasts. Through extensive experiments, we describe training probabilistic models and evaluate their predictive uncertainties based on empirical performance, reliability of confidence estimate, and practical applicability. We also propose improving these models using "free" adversarial training and exploiting temporal and spatial correlation inherent in air quality data. Our experiments demonstrate that the proposed models perform better than previous works in quantifying uncertainty in data-driven air quality forecasts. Overall, Bayesian neural networks provide a more reliable uncertainty estimate but can be challenging to implement and scale. Other scalable methods, such as deep ensemble, Monte Carlo (MC) dropout, and stochastic weight averaging-Gaussian (SWAG), can perform well if applied correctly but with different tradeoffs and slight variations in performance metrics. Finally, our results show the practical impact of uncertainty estimation and demonstrate that, indeed, probabilistic models are more suitable for making informed decisions. Code and dataset are available at \url{https://github.com/Abdulmajid-Murad/deep_probabilistic_forecast}

View on arXiv PDF Code

Similar