CRLGMay 7

TUANDROMD-X: Advanced Entropy and Visual Analytics Dataset for Enhanced Malware Detection and Classification

arXiv:2605.067182.4
Predicted impact top 85% in CR · last 90 daysOriginality Synthesis-oriented
AI Analysis

For malware researchers and cybersecurity experts, this dataset provides a new resource to develop and test machine learning-based detection methods, addressing a known bottleneck in the field.

The paper introduces TUANDROMD-X, a multiclass malware dataset with visual and entropy-based features to address the lack of high-quality datasets for malware detection. The dataset enables faster and better malware detection systems through static analysis.

Malware and malware-based attacks are becoming more prevalent and complex. Attackers regularly come up with new techniques that have the ability to evade conventional and signature-based malware defense. In order to address such threats, there is an increasing demand for advanced and better defense solutions. Machine learning-based techniques are efficiently capable of defending against malware and malware-based attacks. Nevertheless, creating and efficiently testing such techniques demand high-quality datasets having samples of various malware families as well as goodware. The lack of such datasets continues to be a major bottleneck in malware research. In this paper, we introduce TUANDROMD-X, a multiclass malware dataset with visual and entropy-based features of each sample, distinctly identifying malware from goodware. The dataset is created based on static analysis, lowering the overhead that comes with high feature engineering and dynamic analysis. As a result, TUANDROMD-X facilitates researchers and cyber-security experts to design faster and better malware detection systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes