CRJan 5, 2021

LSSD: a Controlled Large JPEG Image Database for Deep-Learning-based Steganalysis "into the Wild"

arXiv:2101.01495v18 citations
Originality Incremental advance
AI Analysis

This database provides a larger and more diverse dataset for the steganalysis community, addressing the problem of limited image diversity in existing datasets for researchers developing and evaluating steganalysis algorithms.

The authors created LSSD, a large JPEG image database comprising 2 million color and grayscale images. This database aims to address the limitation of small, less diverse image datasets in steganalysis, enabling large-scale analysis of steganalysis algorithms.

For many years, the image databases used in steganalysis have been relatively small, i.e. about ten thousand images. This limits the diversity of images and thus prevents large-scale analysis of steganalysis algorithms. In this paper, we describe a large JPEG database composed of 2 million colour and grey-scale images. This database, named LSSD for Large Scale Steganalysis Database, was obtained thanks to the intensive use of \enquote{controlled} development procedures. LSSD has been made publicly available, and we aspire it could be used by the steganalysis community for large-scale experiments. We introduce the pipeline used for building various image database versions. We detail the general methodology that can be used to redevelop the entire database and increase even more the diversity. We also discuss computational cost and storage cost in order to develop images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes