SEAICRFeb 2

Testing Storage-System Correctness: Challenges, Fuzzing Limitations, and AI-Augmented Opportunities

arXiv:2602.02614v1
Originality Synthesis-oriented
AI Analysis

It addresses the problem of systematic testing for storage-system failures, which is critical for reliability in computing infrastructures, but is incremental as it surveys and reframes existing methods.

This survey tackles the challenge of ensuring correctness in storage systems by analyzing existing testing techniques and their limitations, highlighting mismatches between conventional fuzzing and storage semantics, and suggesting AI-augmented approaches for improvement.

Storage systems are fundamental to modern computing infrastructures, yet ensuring their correctness remains challenging in practice. Despite decades of research on system testing, many storage-system failures (including durability, ordering, recovery, and consistency violations) remain difficult to expose systematically. This difficulty stems not primarily from insufficient testing tooling, but from intrinsic properties of storage-system execution, including nondeterministic interleavings, long-horizon state evolution, and correctness semantics that span multiple layers and execution phases. This survey adopts a storage-centric view of system testing and organizes existing techniques according to the execution properties and failure mechanisms they target. We review a broad spectrum of approaches, ranging from concurrency testing and long-running workloads to crash-consistency analysis, hardware-level semantic validation, and distributed fault injection, and analyze their fundamental strengths and limitations. Within this framework, we examine fuzzing as an automated testing paradigm, highlighting systematic mismatches between conventional fuzzing assumptions and storage-system semantics, and discuss how recent artificial intelligence advances may complement fuzzing through state-aware and semantic guidance. Overall, this survey provides a unified perspective on storage-system correctness testing and outlines key challenges

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes