LGApr 8, 2024

WaveCatBoost for Probabilistic Forecasting of Regional Air Quality Data

arXiv:2404.05482v17 citationsh-index: 16IEEE Sensors Letters
Originality Incremental advance
AI Analysis

This work addresses air quality forecasting for public health and urban planning, but it is incremental as it builds on existing hybrid methods.

The paper tackles real-time air pollutant concentration forecasting by proposing a WaveCatBoost architecture that combines MODWT with CatBoost, achieving superior performance on regional datasets from CPCB and LAQS compared to state-of-the-art methods.

Accurate and reliable air quality forecasting is essential for protecting public health, sustainable development, pollution control, and enhanced urban planning. This letter presents a novel WaveCatBoost architecture designed to forecast the real-time concentrations of air pollutants by combining the maximal overlapping discrete wavelet transform (MODWT) with the CatBoost model. This hybrid approach efficiently transforms time series into high-frequency and low-frequency components, thereby extracting signal from noise and improving prediction accuracy and robustness. Evaluation of two distinct regional datasets, from the Central Air Pollution Control Board (CPCB) sensor network and a low-cost air quality sensor system (LAQS), underscores the superior performance of our proposed methodology in real-time forecasting compared to the state-of-the-art statistical and deep learning architectures. Moreover, we employ a conformal prediction strategy to provide probabilistic bands with our forecasts.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes