CRMar 20, 2025

Using Data Redundancy Techniques to Detect and Correct Errors in Logical Data

arXiv:2503.15881
Originality Synthesis-oriented
AI Analysis

It addresses the problem of data integrity for logical data, but the approach is incremental, applying existing RAID concepts to a new domain.

The paper adapts RAID-like data redundancy techniques for logical data, demonstrating robust recovery of arbitrary faults in large archive files using a small fraction of redundant data.

Data redundancy techniques have been tested in several different applications to provide fault tolerance and performance gains. The use of these techniques is mostly seen at the hardware, device driver, or file system level. In practice, the use of data integrity techniques with logical data has largely been limited to verifying the integrity of transferred files using cryptographic hashes. In this paper, we study the RAID scheme used with disk arrays and adapt it for use with logical data. An implementation for such a system is devised in theory and implemented in software, providing the specifications for the procedures and file formats used. Rigorous experimentation is conducted to test the effectiveness of the developed system for multiple use cases. With computer-generated benchmarks and simulated experiments, the system demonstrates robust performance in recovering arbitrary faults in large archive files only using a small fraction of redundant data. This was achieved by leveraging computing power for the process of data recovery.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes