CR AIMay 7, 2024

Leveraging LSTM and GAN for Modern Malware Detection

Ishita Gupta, Sneha Kumari, Priya Jha, Mohona Ghosh

arXiv:2405.04373v111 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses cybersecurity threats for practitioners by enhancing detection speed and accuracy, though it is incremental as it combines existing deep learning methods.

The paper tackles malware detection by integrating LSTM and GAN models to generate synthetic data and improve accuracy, achieving 98% accuracy on the VirusShare dataset.

The malware booming is a cyberspace equal to the effect of climate change to ecosystems in terms of danger. In the case of significant investments in cybersecurity technologies and staff training, the global community has become locked up in the eternal war with cyber security threats. The multi-form and changing faces of malware are continuously pushing the boundaries of the cybersecurity practitioners employ various approaches like detection and mitigate in coping with this issue. Some old mannerisms like signature-based detection and behavioral analysis are slow to adapt to the speedy evolution of malware types. Consequently, this paper proposes the utilization of the Deep Learning Model, LSTM networks, and GANs to amplify malware detection accuracy and speed. A fast-growing, state-of-the-art technology that leverages raw bytestream-based data and deep learning architectures, the AI technology provides better accuracy and performance than the traditional methods. Integration of LSTM and GAN model is the technique that is used for the synthetic generation of data, leading to the expansion of the training datasets, and as a result, the detection accuracy is improved. The paper uses the VirusShare dataset which has more than one million unique samples of the malware as the training and evaluation set for the presented models. Through thorough data preparation including tokenization, augmentation, as well as model training, the LSTM and GAN models convey the better performance in the tasks compared to straight classifiers. The research outcomes come out with 98% accuracy that shows the efficiency of deep learning plays a decisive role in proactive cybersecurity defense. Aside from that, the paper studies the output of ensemble learning and model fusion methods as a way to reduce biases and lift model complexity.

View on arXiv PDF

Similar