CRSep 8, 2021

Unsupervised Detection and Clustering of Malicious TLS Flows

Gibran Gomez, Platon Kotzias, Matteo Dell'Amico, Leyla Bilge, Juan Caballero

arXiv:2109.03878v33.8

Originality Incremental advance

AI Analysis

This work addresses the problem of identifying and categorizing encrypted malicious traffic for network security, offering an unsupervised solution that avoids the need for labeled data, though it is incremental as it builds on prior supervised methods.

The paper tackles the problem of detecting and clustering malicious TLS flows in network traffic, which is challenging due to encryption, by proposing an unsupervised approach that clusters similar flows and builds a detector; it achieves an F1 score of 0.993 in clustering and a false detection rate of 0.032% over four months, outperforming state-of-the-art methods.

Malware abuses TLS to encrypt its malicious traffic, preventing examination by content signatures and deep packet inspection. Network detection of malicious TLS flows is an important, but challenging, problem. Prior works have proposed supervised machine learning detectors using TLS features. However, by trying to represent all malicious traffic, supervised binary detectors produce models that are too loose, thus introducing errors. Furthermore, they do not distinguish flows generated by different malware. On the other hand, supervised multi-class detectors produce tighter models and can classify flows by malware family, but require family labels, which are not available for many samples. To address these limitations, this work proposes a novel unsupervised approach to detect and cluster malicious TLS flows. Our approach takes as input network traces from sandboxes. It clusters similar TLS flows using 90 features that capture properties of the TLS client, TLS server, certificate, and encrypted payload; and uses the clusters to build an unsupervised detector that can assign a malicious flow to the cluster it belongs to, or determine it is benign. We evaluate our approach using 972K traces from a commercial sandbox and 35M TLS flows from a research network. Our clustering shows very high precision and recall with an F1 score of 0.993. We compare our unsupervised detector with two state-of-the-art approaches, showing that it outperforms both. The false detection rate of our detector is 0.032% measured over four months of traffic.

View on arXiv PDF

Similar