LGSep 21, 2025

On the Simplification of Neural Network Architectures for Predictive Process Monitoring

arXiv:2509.17145v1h-index: 11
Originality Incremental advance
AI Analysis

This work addresses efficiency for practitioners in process monitoring, but it is incremental as it builds on existing methods with architectural simplifications.

The paper tackles the high computational cost of deep learning models in Predictive Process Monitoring by analyzing the impact of simplifying model architectures, showing that reducing a Transformer model by 85% leads to only a 2-3% performance drop across tasks.

Predictive Process Monitoring (PPM) aims to forecast the future behavior of ongoing process instances using historical event data, enabling proactive decision-making. While recent advances rely heavily on deep learning models such as LSTMs and Transformers, their high computational cost hinders practical adoption. Prior work has explored data reduction techniques and alternative feature encodings, but the effect of simplifying model architectures themselves remains underexplored. In this paper, we analyze how reducing model complexity, both in terms of parameter count and architectural depth, impacts predictive performance, using two established PPM approaches. Across five diverse event logs, we show that shrinking the Transformer model by 85% results in only a 2-3% drop in performance across various PPM tasks, while the LSTM proves slightly more sensitive, particularly for waiting time prediction. Overall, our findings suggest that substantial model simplification can preserve predictive accuracy, paving the way for more efficient and scalable PPM solutions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes