CRAILGNIJul 17, 2025

PHASE: Passive Human Activity Simulation Evaluation

arXiv:2507.13505v1h-index: 19
Originality Incremental advance
AI Analysis

This addresses the problem of unrealistic human behavior in cybersecurity simulations for practitioners, though it is incremental as it builds on existing network monitoring and machine learning techniques.

The paper tackles the lack of quantitative methods to assess behavioral fidelity in cybersecurity simulations by introducing PHASE, a machine learning framework that distinguishes human from non-human activity in network logs with over 90% accuracy, leading to improved synthetic user personas.

Cybersecurity simulation environments, such as cyber ranges, honeypots, and sandboxes, require realistic human behavior to be effective, yet no quantitative method exists to assess the behavioral fidelity of synthetic user personas. This paper presents PHASE (Passive Human Activity Simulation Evaluation), a machine learning framework that analyzes Zeek connection logs and distinguishes human from non-human activity with over 90\% accuracy. PHASE operates entirely passively, relying on standard network monitoring without any user-side instrumentation or visible signs of surveillance. All network activity used for machine learning is collected via a Zeek network appliance to avoid introducing unnecessary network traffic or artifacts that could disrupt the fidelity of the simulation environment. The paper also proposes a novel labeling approach that utilizes local DNS records to classify network traffic, thereby enabling machine learning analysis. Furthermore, we apply SHAP (SHapley Additive exPlanations) analysis to uncover temporal and behavioral signatures indicative of genuine human users. In a case study, we evaluate a synthetic user persona and identify distinct non-human patterns that undermine behavioral realism. Based on these insights, we develop a revised behavioral configuration that significantly improves the human-likeness of synthetic activity yielding a more realistic and effective synthetic user persona.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes