CR LG NIJul 2, 2025

On the Effect of Ruleset Tuning and Data Imbalance on Explainable Network Security Alert Classifications: a Case-Study on DeepCASE

Koen T. W. Teuwen, Sam Baggen, Emmanuele Zambon, Luca Allodi

arXiv:2507.01571v13.6h-index: 212025 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW)

Originality Synthesis-oriented

AI Analysis

This work addresses data quality issues for security operations centers, but it is incremental as it applies existing methods to a specific domain.

The study investigated how label imbalance in network intrusion alert data affects classification performance and explanation correctness using DeepCASE, showing that tuning detection rules can reduce imbalance and improve both aspects.

Automation in Security Operations Centers (SOCs) plays a prominent role in alert classification and incident escalation. However, automated methods must be robust in the presence of imbalanced input data, which can negatively affect performance. Additionally, automated methods should make explainable decisions. In this work, we evaluate the effect of label imbalance on the classification of network intrusion alerts. As our use-case we employ DeepCASE, the state-of-the-art method for automated alert classification. We show that label imbalance impacts both classification performance and correctness of the classification explanations offered by DeepCASE. We conclude tuning the detection rules used in SOCs can significantly reduce imbalance and may benefit the performance and explainability offered by alert post-processing methods such as DeepCASE. Therefore, our findings suggest that traditional methods to improve the quality of input data can benefit automation.

View on arXiv PDF

Similar