Contrastive Learning for Correlating Network Incidents
This addresses the need for scalable incident correlation in network monitoring, though it appears incremental as it applies an existing learning technique to a specific domain.
The paper tackles the problem of automatically correlating network incidents for internet service providers by proposing a self-supervised contrastive learning method, achieving high precision in experiments on real-world data.
Internet service providers monitor their networks to detect, triage, and remediate service impairments. When an incident is detected, it is important to determine whether similar incidents have occurred in the past or are happening concurrently elsewhere in the network. Manual correlation of such incidents is infeasible due to the scale of the networks under observation, making automated correlation a necessity. This paper presents a self-supervised learning method for similarity-based correlation of network situations. Using this method, a deep neural network is trained on a large unlabeled dataset of network situations using contrastive learning. High precision achieved in experiments on real-world network monitoring data suggests that contrastive learning is a promising approach to network incident correlation.