SEAINov 13, 2024

LogLLM: Log-based Anomaly Detection Using Large Language Models

arXiv:2411.08561v552 citationsh-index: 13
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving software system reliability through more accurate anomaly detection in logs, which is an incremental advancement in the domain of log analysis.

The authors tackled the problem of log-based anomaly detection by proposing LogLLM, a framework that uses large language models to capture semantic information in logs, and demonstrated that it outperforms state-of-the-art methods across four public datasets.

Software systems often record important runtime information in logs to help with troubleshooting. Log-based anomaly detection has become a key research area that aims to identify system issues through log data, ultimately enhancing the reliability of software systems. Traditional deep learning methods often struggle to capture the semantic information embedded in log data, which is typically organized in natural language. In this paper, we propose LogLLM, a log-based anomaly detection framework that leverages large language models (LLMs). LogLLM employs BERT for extracting semantic vectors from log messages, while utilizing Llama, a transformer decoder-based model, for classifying log sequences. Additionally, we introduce a projector to align the vector representation spaces of BERT and Llama, ensuring a cohesive understanding of log semantics. Unlike conventional methods that require log parsers to extract templates, LogLLM preprocesses log messages with regular expressions, streamlining the entire process. Our framework is trained through a novel three-stage procedure designed to enhance performance and adaptability. Experimental results across four public datasets demonstrate that LogLLM outperforms state-of-the-art methods. Even when handling unstable logs, it effectively captures the semantic meaning of log messages and detects anomalies accurately.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes