LGMLJul 16, 2020

Using LSTM and SARIMA Models to Forecast Cluster CPU Usage

arXiv:2007.08092v1
AI Analysis

This work addresses resource allocation for public cloud providers, but it is incremental as it applies existing methods to new data without major innovations.

The study tackled forecasting CPU usage in cloud computing centers using SARIMA and LSTM models, finding that SARIMA outperformed LSTM for long-term predictions (next three days) but performed worse for short-term predictions (next hour), with LSTM being more robust to data assumptions.

As large scale cloud computing centers become more popular than individual servers, predicting future resource demand need has become an important problem. Forecasting resource need allows public cloud providers to proactively allocate or deallocate resources for cloud services. This work seeks to predict one resource, CPU usage, over both a short term and long term time scale. To gain insight into the model characteristics that best support specific tasks, we consider two vastly different architectures: the historically relevant SARIMA model and the more modern neural network, LSTM model. We apply these models to Azure data resampled to 20 minutes per data point with the goal of predicting usage over the next hour for the short-term task and for the next three days for the long-term task. The SARIMA model outperformed the LSTM for the long term prediction task, but performed poorer on the short term task. Furthermore, the LSTM model was more robust, whereas the SARIMA model relied on the data meeting certain assumptions about seasonality.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes