LGCRJan 30, 2023

Reassessing feature-based Android malware detection in a contemporary context

arXiv:2301.12778v47 citationsh-index: 23
Originality Synthesis-oriented
AI Analysis

This work provides practical guidance for Android malware detection practitioners by showing that simple, fast approaches remain effective despite increasing focus on more complex models.

The researchers reimplemented 18 foundational studies on feature-based machine learning for Android malware detection and evaluated them on a contemporary dataset of 124,000 applications, finding that these approaches can still achieve detection accuracies beyond 98% with simpler models often outperforming more complex ones.

We report the findings of a reimplementation of 18 foundational studies in feature-based machine learning for Android malware detection, published during the period 2013-2023. These studies are reevaluated on a level playing field using a contemporary Android environment and a balanced dataset of 124,000 applications. Our findings show that feature-based approaches can still achieve detection accuracies beyond 98%, despite a considerable increase in the size of the underlying Android feature sets. We observe that features derived through dynamic analysis yield only a small benefit over those derived from static analysis, and that simpler models often out-perform more complex models. We also find that API calls and opcodes are the most productive static features within our evaluation context, network traffic is the most predictive dynamic feature, and that ensemble models provide an efficient means of combining models trained on static and dynamic features. Together, these findings suggest that simple, fast machine learning approaches can still be an effective basis for malware detection, despite the increasing focus on slower, more expensive machine learning models in the literature.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes