CRAIFeb 7, 2025

Learning Temporal Invariance in Android Malware Detectors

arXiv:2502.05098v27 citationsh-index: 8
Originality Highly original
AI Analysis

This work addresses a significant problem for Android malware detector developers and users, providing a solution to improve the temporal invariance of detectors.

The authors tackled the problem of Android malware detectors degrading over time due to distribution drift, and their proposed framework TIF achieved state-of-the-art results, particularly in early deployment stages. TIF outperformed existing methods on a decade-long dataset.

Learning-based Android malware detectors degrade over time due to natural distribution drift caused by malware variants and new families. This paper systematically investigates the challenges classifiers trained with empirical risk minimization (ERM) face against such distribution shifts and attributes their shortcomings to their inability to learn stable discriminative features. Invariant learning theory offers a promising solution by encouraging models to generate stable representations crossing environments that expose the instability of the training set. However, the lack of prior environment labels, the diversity of drift factors, and low-quality representations caused by diverse families make this task challenging. To address these issues, we propose TIF, the first temporal invariant training framework for malware detection, which aims to enhance the ability of detectors to learn stable representations across time. TIF organizes environments based on application observation dates to reveal temporal drift, integrating specialized multi-proxy contrastive learning and invariant gradient alignment to generate and align environments with high-quality, stable representations. TIF can be seamlessly integrated into any learning-based detector. Experiments on a decade-long dataset show that TIF excels, particularly in early deployment stages, addressing real-world needs and outperforming state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes