Assessing the Impact of Packing on Machine Learning-Based Malware Detection and Classification Systems
This addresses a critical gap in malware detection for cybersecurity, but it is incremental as it focuses on evaluating existing methods rather than proposing new ones.
The study investigated how packing affects static machine learning-based malware detection and classification systems, particularly those using visualization techniques, finding that current systems have significant limitations in handling packed malware.
The proliferation of malware, particularly through the use of packing, presents a significant challenge to static analysis and signature-based malware detection techniques. The application of packing to the original executable code renders extracting meaningful features and signatures challenging. To deal with the increasing amount of malware in the wild, researchers and anti-malware companies started harnessing machine learning capabilities with very promising results. However, little is known about the effects of packing on static machine learning-based malware detection and classification systems. This work addresses this gap by investigating the impact of packing on the performance of static machine learning-based models used for malware detection and classification, with a particular focus on those using visualisation techniques. To this end, we present a comprehensive analysis of various packing techniques and their effects on the performance of machine learning-based detectors and classifiers. Our findings highlight the limitations of current static detection and classification systems and underscore the need to be proactive to effectively counteract the evolving tactics of malware authors.