CLAIApr 29, 2025

Improving Phishing Email Detection Performance of Small Large Language Models

arXiv:2505.00034v23 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses the need for efficient phishing detection using smaller, more accessible LLMs, though it is incremental as it builds on existing methods to enhance performance.

The paper tackled the problem of poor phishing email detection performance by small large language models (LLMs) with around 3 billion parameters, and through methods like prompt engineering and fine-tuning, it significantly improved accuracy and F1 scores on datasets such as SpamAssassin and CEAS_08, with the models also showing strong transferability to unseen datasets.

Large language models(LLMs) have demonstrated remarkable performance on many natural language processing(NLP) tasks and have been employed in phishing email detection research. However, in current studies, well-performing LLMs typically contain billions or even tens of billions of parameters, requiring enormous computational resources. To reduce computational costs, we investigated the effectiveness of small-parameter LLMs for phishing email detection. These LLMs have around 3 billion parameters and can run on consumer-grade GPUs. However, small LLMs often perform poorly in phishing email detection task. To address these issues, we designed a set of methods including Prompt Engineering, Explanation Augmented Fine-tuning, and Model Ensemble to improve phishing email detection capabilities of small LLMs. We validated the effectiveness of our approach through experiments, significantly improving both accuracy and F1 score on the SpamAssassin and CEAS\_08 datasets. Furthermore, the fine-tuned models demonstrated strong transferability, achieving robust performance across multiple unseen phishing datasets, outperforming traditional baselines and approaching standard-sized LLMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes