SESep 10, 2019

Suggestions on Test Suite Improvements with Automatic Infection and Propagation Analysis

arXiv:1909.04770v18 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of improving test suite effectiveness for software developers, though it appears incremental as it builds on existing mutation testing concepts.

The researchers tackled the problem of identifying why test suites fail to detect extreme transformations in software, developing a tool called Reneri that analyzes runtime differences to suggest improvements. They evaluated Reneri on 15 projects with 312 undetected transformations, finding it generated suggestions for all cases and identified that 63% of failures were due to observability and weak oracle issues.

An extreme transformation removes the body of a method that is reached by one test case at least. If the test suite passes on the original program and still passes after the extreme transformation, the transformation is said to be undetected, and the test suite needs to be improved. In this work we propose a technique to automatically determine which of the following three reasons prevent the detection of the extreme transformation is : the test inputs are not sufficient to infect the state of the program; the infection does not propagate to the test cases; the test cases have a weak oracle that does not observe the infection. We have developed Reneri, a tool that observes the program under test and the test suite in order to determine runtime differences between test runs on the original and the transformed method. The observations gathered during the analysis are processed by Reneri to suggest possible improvements to the developers. We evaluate Reneri on 15 projects and a total of 312 undetected extreme transformations. The tool is able to generate a suggestion for each each undetected transformation. For 63% of the cases, the existing test cases can infect the program state, meaning that undetected transformations are mostly due to observability and weak oracle issues. Interviews with developers confirm the relevance of the suggested improvements and experiments with state of the art automatic test generation tools indicate that no tool can improve the existing test suites to fix all undetected transformations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes