Disturbance-based Discretization, Differentiable IDS Channel, and an IDS-Correcting Code for DNA-based Storage
This addresses the need for universal IDS-correcting code design in DNA-based storage, offering a method for channel-customized codes, though it appears incremental as it builds on autoencoder techniques.
The paper tackles the problem of designing tailored insertion, deletion, and substitution (IDS) error-correcting codes for DNA-based storage by proposing THEA-code, an autoencoder-based approach that uses disturbance-based discretization and a differentiable IDS channel to generate codes, achieving commendable performance in complex IDS channels.
With recent advancements in next-generation data storage, especially in biological molecule-based storage, insertion, deletion, and substitution (IDS) error-correcting codes have garnered increased attention. However, a universal method for designing tailored IDS-correcting codes across varying channel settings remains underexplored. We present an autoencoder-based approach, THEA-code, aimed at efficiently generating IDS-correcting codes for complex IDS channels. In the work, a disturbance-based discretization is proposed to discretize the features of the autoencoder, and a simulated differentiable IDS channel is developed as a differentiable alternative for IDS operations. These innovations facilitate the successful convergence of the autoencoder, producing channel-customized IDS-correcting codes that demonstrate commendable performance across complex IDS channels, particularly in realistic DNA-based storage channels.