MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation
This addresses the gap in cybersecurity expertise for users needing accurate, up-to-date information, though it is incremental as it builds on existing RAG methods.
The paper tackles the problem of providing comprehensive cybersecurity expertise by introducing MoRSE, a specialized AI chatbot that uses parallel retrieval augmented generation systems, resulting in over 10% improvement in answer relevance and correctness compared to state-of-the-art models like GPT-4 on 600 cybersecurity questions.
In this paper, we introduce MoRSE (Mixture of RAGs Security Experts), the first specialised AI chatbot for cybersecurity. MoRSE aims to provide comprehensive and complete knowledge about cybersecurity. MoRSE uses two RAG (Retrieval Augmented Generation) systems designed to retrieve and organize information from multidimensional cybersecurity contexts. MoRSE differs from traditional RAGs by using parallel retrievers that work together to retrieve semantically related information in different formats and structures. Unlike traditional Large Language Models (LLMs) that rely on Parametric Knowledge Bases, MoRSE retrieves relevant documents from Non-Parametric Knowledge Bases in response to user queries. Subsequently, MoRSE uses this information to generate accurate answers. In addition, MoRSE benefits from real-time updates to its knowledge bases, enabling continuous knowledge enrichment without retraining. We have evaluated the effectiveness of MoRSE against other state-of-the-art LLMs, evaluating the system on 600 cybersecurity specific questions. The experimental evaluation has shown that the improvement in terms of relevance and correctness of the answer is more than 10\% compared to known solutions such as GPT-4 and Mixtral 7x8.