SEDec 9, 2021

A Qualitative Study on the Sources, Impacts, and Mitigation Strategies of Flaky Tests

arXiv:2112.04919v135 citations
Originality Synthesis-oriented
AI Analysis

This addresses the issue of flaky tests for software developers and testers, but it is incremental as it builds on existing qualitative research.

The study tackled the problem of flaky tests in software testing by interviewing 14 practitioners to identify sources, impacts, and mitigation strategies, finding that flakiness arises from system interactions and infrastructure, and that guidelines and stable infrastructure are key to mitigation.

Test flakiness forms a major testing concern. Flaky tests manifest non-deterministic outcomes that cripple continuous integration and lead developers to investigate false alerts. Industrial reports indicate that on a large scale, the accrual of flaky tests breaks the trust in test suites and entails significant computational cost. To alleviate this, practitioners are constrained to identify flaky tests and investigate their impact. To shed light on such mitigation mechanisms, we interview 14 practitioners with the aim to identify (i) the sources of flakiness within the testing ecosystem, (ii) the impacts of flakiness, (iii) the measures adopted by practitioners when addressing flakiness, and (iv) the automation opportunities for these measures. Our analysis shows that, besides the tests and code, flakiness stems from interactions between the system components, the testing infrastructure, and external factors. We also highlight the impact of flakiness on testing practices and product quality and show that the adoption of guidelines together with a stable infrastructure are key measures in mitigating the problem.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes