CLAISep 3, 2025

Domain Adaptation of LLMs for Process Data

arXiv:2509.03161v11 citationsh-index: 31
Originality Incremental advance
AI Analysis

This addresses the problem of computational efficiency and performance in process mining for researchers and practitioners, though it is incremental as it builds on existing fine-tuning techniques.

The study tackled adapting pretrained large language models directly to process data for predictive process monitoring, achieving improved predictive performance over state-of-the-art RNN and narrative-based methods, with faster convergence and less hyperparameter tuning.

In recent years, Large Language Models (LLMs) have emerged as a prominent area of interest across various research domains, including Process Mining (PM). Current applications in PM have predominantly centered on prompt engineering strategies or the transformation of event logs into narrative-style datasets, thereby exploiting the semantic capabilities of LLMs to address diverse tasks. In contrast, this study investigates the direct adaptation of pretrained LLMs to process data without natural language reformulation, motivated by the fact that these models excel in generating sequences of tokens, similar to the objective in PM. More specifically, we focus on parameter-efficient fine-tuning techniques to mitigate the computational overhead typically associated with such models. Our experimental setup focuses on Predictive Process Monitoring (PPM), and considers both single- and multi-task predictions. The results demonstrate a potential improvement in predictive performance over state-of-the-art recurrent neural network (RNN) approaches and recent narrative-style-based solutions, particularly in the multi-task setting. Additionally, our fine-tuned models exhibit faster convergence and require significantly less hyperparameter optimization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes