CRMar 13

Architectural Selection Framework for Synthetic Network Traffic: Quantifying the Fidelity-Utility Trade-off

Dure Adan Ammara, Jianguo Ding, Kurt Tutschku

arXiv:2410.1632615.97 citations

Predicted impact top 74% in CR · last 90 daysOriginality Incremental advance

AI Analysis

This provides security practitioners with an evidence-based guide for mitigating architectural failures in synthetic data deployment for adaptive security solutions, though it is incremental as it focuses on benchmarking existing methods.

The study tackled the problem of architectural mismatch and scalability failure in synthetic network traffic by establishing an Architectural Selection Framework to quantify the fidelity-utility trade-off, finding that GAN-based models like CTGAN and CopulaGAN consistently achieved the optimal balance across two datasets over twenty runs.

The fidelity and utility of synthetic network traffic are critically compromised by architectural mismatch across heterogeneous network datasets and prevalent scalability failure. This study addresses this challenge by establishing an Architectural Selection Framework that empirically quantifies how data structure compatibility dictates the optimal fidelity-utility trade-off. We systematically evaluate twelve generative architectures (both non-AI and AI) across two distinct data structure types: categorical-heavy NSL-KDD and continuous-flow-heavy CIC-IDS2017. Fidelity is rigorously assessed through three structural metrics (Data Structure, Correlation, and Probability Distribution Difference) to confirm structural realism before evaluating downstream utility. Our results, confirmed over twenty independent runs (N=20), demonstrate that GAN-based models (CTGAN, CopulaGAN) exhibit superior architectural robustness, consistently achieving the optimal balance of statistical fidelity and practical utility. Conversely, the framework exposes critical failure modes, i.e., statistical methods compromise structural fidelity for utility (Compromised fidelity), and modern iterative architectures, such as Diffusion Models, face prohibitive computational barriers, rendering them impractical for large-scale security deployment. This contribution provides security practitioners with an evidence-based guide for mitigating architectural failures, thereby setting a benchmark for reliable and scalable synthetic data deployment in adaptive security solutions.

View on arXiv PDF

Similar