CR AI CL LG SYAug 26, 2025

FALCON: Autonomous Cyber Threat Intelligence Mining with LLMs for IDS Rule Generation

Shaswata Mitra, Azim Bazarov, Martin Duclos, Sudip Mittal, Aritran Piplai, Md Rayhanur Rahman, Edward Zieglar, Shahram Rahimi

arXiv:2508.18684v15 citationsh-index: 15

Originality Incremental advance

AI Analysis

This addresses the need for faster threat mitigation in cybersecurity by automating IDS rule generation, though it appears incremental as it applies existing LLM agentic systems to a specific domain.

The paper tackles the problem of delayed rule updates in signature-based Intrusion Detection Systems (IDS) by introducing FALCON, an autonomous agentic framework that generates deployable IDS rules from Cyber Threat Intelligence (CTI) data in real-time, achieving 95% accuracy in rule generation with 84% inter-rater agreement among analysts.

Signature-based Intrusion Detection Systems (IDS) detect malicious activities by matching network or host activity against predefined rules. These rules are derived from extensive Cyber Threat Intelligence (CTI), which includes attack signatures and behavioral patterns obtained through automated tools and manual threat analysis, such as sandboxing. The CTI is then transformed into actionable rules for the IDS engine, enabling real-time detection and prevention. However, the constant evolution of cyber threats necessitates frequent rule updates, which delay deployment time and weaken overall security readiness. Recent advancements in agentic systems powered by Large Language Models (LLMs) offer the potential for autonomous IDS rule generation with internal evaluation. We introduce FALCON, an autonomous agentic framework that generates deployable IDS rules from CTI data in real-time and evaluates them using built-in multi-phased validators. To demonstrate versatility, we target both network (Snort) and host-based (YARA) mediums and construct a comprehensive dataset of IDS rules with their corresponding CTIs. Our evaluations indicate FALCON excels in automatic rule generation, with an average of 95% accuracy validated by qualitative evaluation with 84% inter-rater agreement among multiple cybersecurity analysts across all metrics. These results underscore the feasibility and effectiveness of LLM-driven data mining for real-time cyber threat mitigation.

View on arXiv PDF

Similar