CRLGSep 24, 2021

The More, the Better? A Study on Collaborative Machine Learning for DGA Detection

arXiv:2109.11830v1
Originality Synthesis-oriented
AI Analysis

This work addresses DGA detection for cybersecurity, but it is incremental as it builds on existing collaborative learning methods.

The study investigated collaborative machine learning for detecting domain generation algorithms (DGAs) in botnets, showing that it can reduce false positive rates by up to 51.7% in real-world scenarios.

Domain generation algorithms (DGAs) prevent the connection between a botnet and its master from being blocked by generating a large number of domain names. Promising single-data-source approaches have been proposed for separating benign from DGA-generated domains. Collaborative machine learning (ML) can be used in order to enhance a classifier's detection rate, reduce its false positive rate (FPR), and to improve the classifier's generalization capability to different networks. In this paper, we complement the research area of DGA detection by conducting a comprehensive collaborative learning study, including a total of 13,440 evaluation runs. In two real-world scenarios we evaluate a total of eleven different variations of collaborative learning using three different state-of-the-art classifiers. We show that collaborative ML can lead to a reduction in FPR by up to 51.7%. However, while collaborative ML is beneficial for DGA detection, not all approaches and classifier types profit equally. We round up our comprehensive study with a thorough discussion of the privacy threats implicated by the different collaborative ML approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes