CRFeb 10, 2020

Droidetec: Android Malware Detection and Malicious Code Localization through Deep Learning

Zhuo Ma, Haoran Ge, Zhuzhu Wang, Yang Liu, Ximeng Liu

arXiv:2002.03594v111.545 citations

Originality Incremental advance

AI Analysis

This addresses the problem of manual code analysis for Android security, offering an automated deep learning solution with incremental improvements in detection and localization.

The paper tackles Android malware detection and malicious code localization by modeling applications as natural language sequences, achieving 97.22% accuracy and 91% hit rate for locating malicious code.

Android malware detection is a critical step towards building a security credible system. Especially, manual search for the potential malicious code has plagued program analysts for a long time. In this paper, we propose Droidetec, a deep learning based method for android malware detection and malicious code localization, to model an application program as a natural language sequence. Droidetec adopts a novel feature extraction method to derive behavior sequences from Android applications. Based on that, the bi-directional Long Short Term Memory network is utilized for malware detection. Each unit in the extracted behavior sequence is inventively represented as a vector, which allows Droidetec to automatically analyze the semantics of sequence segments and eventually find out the malicious code. Experiments with 9616 malicious and 11982 benign programs show that Droidetec reaches an accuracy of 97.22% and an F1-score of 98.21%. In all, Droidetec has a hit rate of 91% to properly find out malicious code segments.

View on arXiv PDF

Similar