LGJun 28, 2024

CHASE: A Causal Hypergraph based Framework for Root Cause Analysis in Multimodal Microservice Systems

arXiv:2406.19711v25 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of quickly locating anomalies in enterprise microservice systems to improve availability and maintenance, representing a domain-specific incremental advancement.

The paper tackles the problem of root cause analysis in complex microservice systems by proposing CHASE, a framework that uses causal hypergraphs with multimodal data, achieving performance gains of up to 36.2% and 29.4% compared to state-of-the-art methods.

In recent years, the widespread adoption of distributed microservice architectures within the industry has significantly increased the demand for enhanced system availability and robustness. Due to the complex service invocation paths and dependencies in enterprise-level microservice systems, it is challenging to locate the anomalies promptly during service invocations, thus causing intractable issues for normal system operations and maintenance. In this paper, we propose a Causal Heterogeneous grAph baSed framEwork for root cause analysis, namely CHASE, for microservice systems with multimodal data, including traces, logs, and system monitoring metrics. Specifically, related information is encoded into representative embeddings and further modeled by a multimodal invocation graph. Following that, anomaly detection is performed on each instance node with attentive heterogeneous message passing from its adjacent metric and log nodes. Finally, CHASE learns from the constructed hypergraph with hyperedges representing the flow of causality and performs root cause localization. We evaluate the proposed framework on two public microservice datasets with distinct attributes and compare with the state-of-the-art methods. The results show that CHASE achieves the average performance gain up to 36.2%(A@1) and 29.4%(Percentage@1), respectively to its best counterpart.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes