On the Unusual Effectiveness of Type-Aware Operator Mutations for Testing SMT Solvers
This addresses the challenge of ensuring reliability in SMT solvers, which are critical for formal verification and automated reasoning, by providing an effective testing method that found numerous high-quality bugs, though it is incremental as it builds on mutation-based testing approaches.
The authors tackled the problem of testing SMT solvers by proposing type-aware operator mutation, which generated well-typed mutant formulas to find bugs, resulting in 819 confirmed bugs, including 184 critical soundness bugs, during one year of testing on Z3 and CVC4.
We propose type-aware operator mutation, a simple, but unusually effective approach for testing SMT solvers. The key idea is to mutate operators of conforming types within the seed formulas to generate well-typed mutant formulas. These mutant formulas are then used as the test cases for SMT solvers. We realized type-aware operator mutation within the OpFuzz tool and used it to stress-test Z3 and CVC4, two state-of-the-art SMT solvers. Type-aware operator mutations are unusually effective: During one year of extensive testing with OpFuzz, we reported 1,092 bugs on Z3's and CVC4's respective GitHub issue trackers, out of which 819 unique bugs were confirmed and 685 of the confirmed bugs were fixed by the developers. The detected bugs are highly diverse -- we found bugs of many different types (soundness bugs, invalid model bugs, crashes, etc.), logics and solver configurations. We have further conducted an in-depth study of the bugs found by OpFuzz. The study results show that the bugs found by OpFuzz are of high quality. Many of them affect core components of the SMT solvers' codebases, and some required major changes for the developers to fix. Among the 819 confirmed bugs found by OpFuzz, 184 were soundness bugs, the most critical bugs in SMT solvers, and 489 were in the default modes of the solvers. Notably, OpFuzz found 27 critical soundness bugs in CVC4, which has proved to be a very stable SMT solver.