CRAILGJan 19

Diffusion-Driven Synthetic Tabular Data Generation for Enhanced DoS/DDoS Attack Classification

arXiv:2601.13197v1
Originality Synthesis-oriented
AI Analysis

This addresses data imbalance for network security applications, though it is incremental as it applies an existing method to a specific domain.

The paper tackled class imbalance in network intrusion detection by using TabDDPM to generate synthetic minority-class samples from the CIC-IDS2017 dataset, resulting in near-perfect recall for underrepresented attack classes with an ANN classifier.

Class imbalance refers to a situation where certain classes in a dataset have significantly fewer samples than oth- ers, leading to biased model performance. Class imbalance in network intrusion detection using Tabular Denoising Diffusion Probability Models (TabDDPM) for data augmentation is ad- dressed in this paper. Our approach synthesizes high-fidelity minority-class samples from the CIC-IDS2017 dataset through iterative denoising processes. For the minority classes that have smaller samples, synthetic samples were generated and merged with the original dataset. The augmented training data enables an ANN classifier to achieve near-perfect recall on previously underrepresented attack classes. These results establish diffusion models as an effective solution for tabular data imbalance in security domains, with potential applications in fraud detection and medical diagnostics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes