LGAICRSep 23, 2024

Research on Dynamic Data Flow Anomaly Detection based on Machine Learning

arXiv:2409.14796v112 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This addresses data security challenges for cybersecurity applications by improving anomaly detection in unbalanced data, though it appears incremental as it builds on existing unsupervised methods.

The paper tackled the problem of detecting anomalies in dynamic data flows, especially with unbalanced data, using an unsupervised learning method with clustering, and achieved high accuracy and robust performance in various scenarios.

The sophistication and diversity of contemporary cyberattacks have rendered the use of proxies, gateways, firewalls, and encrypted tunnels as a standalone defensive strategy inadequate. Consequently, the proactive identification of data anomalies has emerged as a prominent area of research within the field of data security. The majority of extant studies concentrate on sample equilibrium data, with the consequence that the detection effect is not optimal in the context of unbalanced data. In this study, the unsupervised learning method is employed to identify anomalies in dynamic data flows. Initially, multi-dimensional features are extracted from real-time data, and a clustering algorithm is utilised to analyse the patterns of the data. This enables the potential outliers to be automatically identified. By clustering similar data, the model is able to detect data behaviour that deviates significantly from normal traffic without the need for labelled data. The results of the experiments demonstrate that the proposed method exhibits high accuracy in the detection of anomalies across a range of scenarios. Notably, it demonstrates robust and adaptable performance, particularly in the context of unbalanced data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes