LG NIMay 21, 2023

Many or Few Samples? Comparing Transfer, Contrastive and Meta-Learning in Encrypted Traffic Classification

Idio Guarino, Chao Wang, Alessandro Finamore, Antonio Pescape, Dario Rossi

arXiv:2305.12432v26.614 citations

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of encrypted traffic classification for network security, but it is incremental as it compares existing methods without introducing new techniques.

The paper tackled the problem of traffic classification with limited labeled data by comparing 16 methods including transfer, contrastive, and meta-learning, finding that contrastive learning performed best and meta-learning worst, with DL methods matching tree-based models on small tasks.

The popularity of Deep Learning (DL), coupled with network traffic visibility reduction due to the increased adoption of HTTPS, QUIC and DNS-SEC, re-ignited interest towards Traffic Classification (TC). However, to tame the dependency from task-specific large labeled datasets we need to find better ways to learn representations that are valid across tasks. In this work we investigate this problem comparing transfer learning, meta-learning and contrastive learning against reference Machine Learning (ML) tree-based and monolithic DL models (16 methods total). Using two publicly available datasets, namely MIRAGE19 (40 classes) and AppClassNet (500 classes), we show that (i) using large datasets we can obtain more general representations, (ii) contrastive learning is the best methodology and (iii) meta-learning the worst one, and (iv) while ML tree-based cannot handle large tasks but fits well small tasks, by means of reusing learned representations, DL methods are reaching tree-based models performance also for small tasks.

View on arXiv PDF

Similar