SEDCAug 5, 2021

TRANSMUT-SPARK: Transformation Mutation for Apache Spark

arXiv:2108.02589v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses the need for automated testing to prevent production losses in Big Data applications, but it is incremental as it applies an existing testing technique (mutation testing) specifically to Spark.

The authors tackled the problem of automating mutation testing for Big Data processing code in Apache Spark programs, resulting in TRANSMUT-Spark, a tool that fully automates the mutation testing process, including mutant generation, test execution, and adequacy analysis.

We propose TRANSMUT-Spark, a tool that automates the mutation testing process of Big Data processing code within Spark programs. Apache Spark is an engine for Big Data Processing. It hides the complexity inherent to Big Data parallel and distributed programming and processing through built-in functions, underlying parallel processes, and data management strategies. Nonetheless, programmers must cleverly combine these functions within programs and guide the engine to use the right data management strategies to exploit the large number of computational resources required by Big Data processing and avoid substantial production losses. Many programming details in data processing code within Spark programs are prone to false statements that need to be correctly and automatically tested. This paper explores the application of mutation testing in Spark programs, a fault-based testing technique that relies on fault simulation to evaluate and design test sets. The paper introduces the TRANSMUT-Spark solution for testing Spark programs. TRANSMUT-Spark automates the most laborious steps of the process and fully executes the mutation testing process. The paper describes how the tool automates the mutants generation, test execution, and adequacy analysis phases of mutation testing with TRANSMUT-Spark. It also discusses the results of experiments that were carried out to validate the tool to argue its scope and limitations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes