Leveraging Duration Pseudo-Embeddings in Multilevel LSTM and GCN Hypermodels for Outcome-Oriented PPM
This work addresses limitations in deep learning models for Predictive Process Monitoring, offering a flexible design for robust, real-world applications, though it appears incremental as it builds on existing LSTM and GCN baselines.
The paper tackled the problem of temporal irregularities in Predictive Process Monitoring by proposing a dual input neural network strategy with duration-aware pseudo-embeddings, resulting in improved generalization, reduced model complexity, and enhanced interpretability across balanced and imbalanced outcome prediction tasks.
Existing deep learning models for Predictive Process Monitoring (PPM) struggle with temporal irregularities, particularly stochastic event durations and overlapping timestamps, limiting their adaptability across heterogeneous datasets. We propose a dual input neural network strategy that separates event and sequence attributes, using a duration-aware pseudo-embedding matrix to transform temporal importance into compact, learnable representations. This design is implemented across two baseline families: B-LSTM and B-GCN, and their duration-aware variants D-LSTM and D-GCN. All models incorporate self-tuned hypermodels for adaptive architecture selection. Experiments on balanced and imbalanced outcome prediction tasks show that duration pseudo-embedding inputs consistently improve generalization, reduce model complexity, and enhance interpretability. Our results demonstrate the benefits of explicit temporal encoding and provide a flexible design for robust, real-world PPM applications.