CRJun 3, 2020

SQUIRREL: Testing Database Management Systems with Language Validity and Coverage Feedback

arXiv:2006.02398v1138 citations
Originality Incremental advance
AI Analysis

This addresses a critical issue for DBMS developers and users by improving software reliability and security, though it is an incremental advancement over existing fuzzing techniques.

The paper tackles the problem of effectively testing database management systems (DBMSs) by developing Squirrel, a fuzzing framework that ensures language validity and uses coverage feedback, resulting in the discovery of 63 bugs across four DBMSs and achieving up to 243.9x higher semantic correctness and 10.9x more edge coverage than state-of-the-art tools.

Fuzzing is an increasingly popular technique for verifying software functionalities and finding security vulnerabilities. However, current mutation-based fuzzers cannot effectively test database management systems (DBMSs), which strictly check inputs for valid syntax and semantics. Generation-based testing can guarantee the syntax correctness of the inputs, but it does not utilize any feedback, like code coverage, to guide the path exploration. In this paper, we develop Squirrel, a novel fuzzing framework that considers both language validity and coverage feedback to test DBMSs. We design an intermediate representation (IR) to maintain SQL queries in a structural and informative manner. To generate syntactically correct queries, we perform type-based mutations on IR, including statement insertion, deletion and replacement. To mitigate semantic errors, we analyze each IR to identify the logical dependencies between arguments, and generate queries that satisfy these dependencies. We evaluated Squirrel on four popular DBMSs: SQLite, MySQL, PostgreSQL and MariaDB. Squirrel found 51 bugs in SQLite, 7 in MySQL and 5 in MariaDB. 52 of the bugs are fixed with 12 CVEs assigned. In our experiment, Squirrel achieves 2.4x-243.9x higher semantic correctness than state-of-the-art fuzzers, and explores 2.0x-10.9x more new edges than mutation-based tools. These results show that Squirrel is effective in finding memory errors of database management systems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes