AISep 22, 2024

LLMs are One-Shot URL Classifiers and Explainers

Fariza Rashid, Nishavi Ranaweera, Ben Doyle, Suranga Seneviratne

arXiv:2409.14306v19.624 citationsh-index: 22

Originality Incremental advance

AI Analysis

This addresses the problem of generalization and lack of explanations in URL classification for cybersecurity, though it is incremental as it applies existing LLM methods to a new domain.

The authors tackled malicious URL classification by proposing a one-shot learning framework using Large Language Models (LLMs) with Chain-of-Thought reasoning, showing that LLMs achieve performance close to supervised models, with GPT 4-Turbo performing best, and their explanations align with supervised classifiers and are highly readable.

Malicious URL classification represents a crucial aspect of cyber security. Although existing work comprises numerous machine learning and deep learning-based URL classification models, most suffer from generalisation and domain-adaptation issues arising from the lack of representative training datasets. Furthermore, these models fail to provide explanations for a given URL classification in natural human language. In this work, we investigate and demonstrate the use of Large Language Models (LLMs) to address this issue. Specifically, we propose an LLM-based one-shot learning framework that uses Chain-of-Thought (CoT) reasoning to predict whether a given URL is benign or phishing. We evaluate our framework using three URL datasets and five state-of-the-art LLMs and show that one-shot LLM prompting indeed provides performances close to supervised models, with GPT 4-Turbo being the best model, followed by Claude 3 Opus. We conduct a quantitative analysis of the LLM explanations and show that most of the explanations provided by LLMs align with the post-hoc explanations of the supervised classifiers, and the explanations have high readability, coherency, and informativeness.

View on arXiv PDF

Similar