AIJul 8, 2024

PORCA: Root Cause Analysis with Partially Observed Data

arXiv:2407.05869v25 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses the problem of reliable fault diagnosis in complex systems for domains like IT or industrial monitoring, but it is incremental as it builds on existing RCA methods by handling partial observation.

The paper tackles the problem of root cause analysis (RCA) in systems with partially observed data, which previous methods assumed full observation, leading to unreliable results. It proposes PORCA, a framework that addresses unobserved confounders and heterogeneity, showing effectiveness and superiority in experiments on synthetic and real-world datasets.

Root Cause Analysis (RCA) aims at identifying the underlying causes of system faults by uncovering and analyzing the causal structure from complex systems. It has been widely used in many application domains. Reliable diagnostic conclusions are of great importance in mitigating system failures and financial losses. However, previous studies implicitly assume a full observation of the system, which neglect the effect of partial observation (i.e., missing nodes and latent malfunction). As a result, they fail in deriving reliable RCA results. In this paper, we unveil the issues of unobserved confounders and heterogeneity in partial observation and come up with a new problem of root cause analysis with partially observed data. To achieve this, we propose PORCA, a novel RCA framework which can explore reliable root causes under both unobserved confounders and unobserved heterogeneity. PORCA leverages magnified score-based causal discovery to efficiently optimize acyclic directed mixed graph under unobserved confounders. In addition, we also develop a heterogeneity-aware scheduling strategy to provide adaptive sample weights. Extensive experimental results on one synthetic and two real-world datasets demonstrate the effectiveness and superiority of the proposed framework.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes