Exploiting Event Log Event Attributes in RNN Based Prediction
This work addresses a specific bottleneck in process mining for practitioners, offering an incremental improvement in handling event attributes.
The paper tackles the problem of underutilized event attributes in RNN-based predictive process analytics by introducing a novel clustering technique that balances prediction accuracy with training and prediction time, achieving improved accuracy in some cases at the cost of additional time.
In predictive process analytics, current and historical process data in event logs is used to predict the future, e.g., to predict the next activity or how long a process will still require to complete. Recurrent neural networks (RNN) and its subclasses have been demonstrated to be well suited for creating prediction models. Thus far, event attributes have not been fully utilized in these models. The biggest challenge in exploiting them in prediction models is the potentially large amount of event attributes and attribute values. We present a novel clustering technique that allows for trade-offs between prediction accuracy and the time needed for model training and prediction. As an additional finding, we also find that this clustering method combined with having raw event attribute values in some cases provides even better prediction accuracy at the cost of additional time required for training and prediction.