SEJun 3, 2015

Clustering-Based Predictive Process Monitoring

arXiv:1506.01428v1165 citations
Originality Synthesis-oriented
AI Analysis

This addresses predictive process monitoring for business analysts, but it is incremental as it builds on existing clustering and classification methods.

The paper tackles the problem of predicting whether a given predicate will be fulfilled in running business process cases by proposing a clustering-based framework that uses event logs, achieving validation on a real-world hospital cancer treatment log.

Business process enactment is generally supported by information systems that record data about process executions, which can be extracted as event logs. Predictive process monitoring is concerned with exploiting such event logs to predict how running (uncompleted) cases will unfold up to their completion. In this paper, we propose a predictive process monitoring framework for estimating the probability that a given predicate will be fulfilled upon completion of a running case. The predicate can be, for example, a temporal logic constraint or a time constraint, or any predicate that can be evaluated over a completed trace. The framework takes into account both the sequence of events observed in the current trace, as well as data attributes associated to these events. The prediction problem is approached in two phases. First, prefixes of previous traces are clustered according to control flow information. Secondly, a classifier is built for each cluster using event data to discriminate between fulfillments and violations. At runtime, a prediction is made on a running case by mapping it to a cluster and applying the corresponding classifier. The framework has been implemented in the ProM toolset and validated on a log pertaining to the treatment of cancer patients in a large hospital.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes