CR AINov 24, 2025

Accuracy and Efficiency Trade-Offs in LLM-Based Malware Detection and Explanation: A Comparative Study of Parameter Tuning vs. Full Fine-Tuning

Stephen C. Gravereaux, Sheikh Rabiul Islam

arXiv:2511.19654v1

Originality Incremental advance

AI Analysis

It addresses the problem of resource-efficient and interpretable malware detection for cybersecurity analysts, though it is incremental as it builds on existing LoRA methods.

This study compared Low-Rank Adaptation (LoRA) fine-tuning to full fine-tuning for LLM-based malware detection and explanation, finding that full fine-tuning achieved up to 10% higher BLEU and ROUGE scores, but mid-range LoRA models offered competitive performance with 81% smaller model size and over 80% faster training.

This study examines whether Low-Rank Adaptation (LoRA) fine-tuned Large Language Models (LLMs) can approximate the performance of fully fine-tuned models in generating human-interpretable decisions and explanations for malware classification. Achieving trustworthy malware detection, particularly when LLMs are involved, remains a significant challenge. We developed an evaluation framework using Bilingual Evaluation Understudy (BLEU), Recall-Oriented Understudy for Gisting Evaluation (ROUGE), and Semantic Similarity Metrics to benchmark explanation quality across five LoRA configurations and a fully fine-tuned baseline. Results indicate that full fine-tuning achieves the highest overall scores, with BLEU and ROUGE improvements of up to 10% over LoRA variants. However, mid-range LoRA models deliver competitive performance exceeding full fine-tuning on two metrics while reducing model size by approximately 81% and training time by over 80% on a LoRA model with 15.5% trainable parameters. These findings demonstrate that LoRA offers a practical balance of interpretability and resource efficiency, enabling deployment in resource-constrained environments without sacrificing explanation quality. By providing feature-driven natural language explanations for malware classifications, this approach enhances transparency, analyst confidence, and operational scalability in malware detection systems.

View on arXiv PDF

Similar