SEDCAug 17, 2018

Learning-based Automatic Parameter Tuning for Big Data Analytics Frameworks

arXiv:1808.06008v11 citations
Originality Incremental advance
AI Analysis

This work addresses the tuning issue that overwhelms users of big data analytics frameworks, offering a practical solution with significant performance gains, though it is incremental in nature.

The paper tackles the challenge of automatic parameter tuning for big data analytics frameworks by introducing AutoTune, which constructs a smaller-scale testbed to generate more samples and train a better prediction model, resulting in an average improvement of 63.70% over default configurations and 6%-23% over state-of-the-art tuning algorithms.

Big data analytics frameworks (BDAFs) have been widely used for data processing applications. These frameworks provide a large number of configuration parameters to users, which leads to a tuning issue that overwhelms users. To address this issue, many automatic tuning approaches have been proposed. However, it remains a critical challenge to generate enough samples in a high-dimensional parameter space within a time constraint. In this paper, we present AutoTune--an automatic parameter tuning system that aims to optimize application execution time on BDAFs. AutoTune first constructs a smaller-scale testbed from the production system so that it can generate more samples, and thus train a better prediction model, under a given time constraint. Furthermore, the AutoTune algorithm produces a set of samples that can provide a wide coverage over the high-dimensional parameter space, and searches for more promising configurations using the trained prediction model. AutoTune is implemented and evaluated using the Spark framework and HiBench benchmark deployed on a public cloud. Extensive experimental results illustrate that AutoTune improves on default configurations by 63.70% on average, and on the five state-of-the-art tuning algorithms by 6%-23%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes