SILGMLJun 8, 2018

Discovering Signals from Web Sources to Predict Cyber Attacks

arXiv:1806.03342v130 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the growing threat of cyber attacks for organizations and individuals by providing a predictive defense system, though it appears incremental as it applies existing machine learning techniques to this domain.

The paper tackles the problem of predicting cyber attacks by analyzing digital traces left by hackers on web platforms, using deep neural networks and autoregressive time series models. The result shows a significant increase in F1 score for top signals in real-world forecasting tasks.

Cyber attacks are growing in frequency and severity. Over the past year alone we have witnessed massive data breaches that stole personal information of millions of people and wide-scale ransomware attacks that paralyzed critical infrastructure of several countries. Combating the rising cyber threat calls for a multi-pronged strategy, which includes predicting when these attacks will occur. The intuition driving our approach is this: during the planning and preparation stages, hackers leave digital traces of their activities on both the surface web and dark web in the form of discussions on platforms like hacker forums, social media, blogs and the like. These data provide predictive signals that allow anticipating cyber attacks. In this paper, we describe machine learning techniques based on deep neural networks and autoregressive time series models that leverage external signals from publicly available Web sources to forecast cyber attacks. Performance of our framework across ground truth data over real-world forecasting tasks shows that our methods yield a significant lift or increase of F1 for the top signals on predicted cyber attacks. Our results suggest that, when deployed, our system will be able to provide an effective line of defense against various types of targeted cyber attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes