MMAICVIVNov 7, 2022

Using Set Covering to Generate Databases for Holistic Steganalysis

arXiv:2211.03447v28 citationsh-index: 33Has Code
AI Analysis

This addresses the operational challenge of steganalysis performance degradation due to mismatched cover sources, though it is incremental in optimizing pipeline selection.

The paper tackled the Cover Source Mismatch (CSM) problem in steganalysis by developing a methodology using a set-covering greedy algorithm to select representative processing pipelines, which improved performance over random selection or using all pipelines for out-of-distribution covers.

Within an operational framework, covers used by a steganographer are likely to come from different sensors and different processing pipelines than the ones used by researchers for training their steganalysis models. Thus, a performance gap is unavoidable when it comes to out-of-distributions covers, an extremely frequent scenario called Cover Source Mismatch (CSM). Here, we explore a grid of processing pipelines to study the origins of CSM, to better understand it, and to better tackle it. A set-covering greedy algorithm is used to select representative pipelines minimizing the maximum regret between the representative and the pipelines within the set. Our main contribution is a methodology for generating relevant bases able to tackle operational CSM. Experimental validation highlights that, for a given number of training samples, our set covering selection is a better strategy than selecting random pipelines or using all the available pipelines. Our analysis also shows that parameters as denoising, sharpening, and downsampling are very important to foster diversity. Finally, different benchmarks for classical and wild databases show the good generalization property of the extracted databases. Additional resources are available at github.com/RonyAbecidan/HolisticSteganalysisWithSetCovering.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes