FrUITeR: A Framework for Evaluating UI Test Reuse
This addresses the need for scalable and reproducible evaluation in UI testing research, which is incremental as it builds on existing test-reuse techniques.
The paper tackles the problem of manual, unscalable, and unreproducible evaluation in UI test reuse by introducing FrUITeR, a framework that automatically evaluates such techniques, resulting in 11,917 test reuse cases from 20 apps and revealing key findings missed by prior work.
UI testing is tedious and time-consuming due to the manual effort required. Recent research has explored opportunities for reusing existing UI tests from an app to automatically generate new tests for other apps. However, the evaluation of such techniques currently remains manual, unscalable, and unreproducible, which can waste effort and impede progress in this emerging area. We introduce FrUITeR, a framework that automatically evaluates UI test reuse in a reproducible way. We apply FrUITeR to existing test-reuse techniques on a uniform benchmark we established, resulting in 11,917 test reuse cases from 20 apps. We report several key findings aimed at improving UI test reuse that are missed by existing work.