AILGNov 12, 2022

A Pipeline for Business Intelligence and Data-Driven Root Cause Analysis on Categorical Data

arXiv:2211.06717v1h-index: 16
Originality Synthesis-oriented
AI Analysis

This work addresses business intelligence and root cause analysis for businesses and data scientists, but it is incremental as it combines existing methods.

The paper tackles the problem of extracting business intelligence and performing root cause analysis from categorical data by proposing a new pipeline combining clustering and association rule mining, resulting in association rules with metrics to explain events and support decisions.

Business intelligence (BI) is any knowledge derived from existing data that may be strategically applied within a business. Data mining is a technique or method for extracting BI from data using statistical data modeling. Finding relationships or correlations between the various data items that have been collected can be used to boost business performance or at the very least better comprehend what is going on. Root cause analysis (RCA) is discovering the root causes of problems or events to identify appropriate solutions. RCA can show why an event occurred and this can help in avoiding occurrences of an issue in the future. This paper proposes a new clustering + association rule mining pipeline for getting business insights from data. The results of this pipeline are in the form of association rules having consequents, antecedents, and various metrics to evaluate these rules. The results of this pipeline can help in anchoring important business decisions and can also be used by data scientists for updating existing models or while developing new ones. The occurrence of any event is explained by its antecedents in the generated rules. Hence this output can also help in data-driven root cause analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes