CLCYLGAug 25, 2025

The ProLiFIC dataset: Leveraging LLMs to Unveil the Italian Lawmaking Process

arXiv:2509.03528v1
Originality Synthesis-oriented
AI Analysis

This work addresses a data bottleneck for researchers applying Process Mining to legal systems, specifically in Italy, though it is incremental as it builds on existing efforts to integrate LLMs with Process Mining.

The authors tackled the limited accessibility and quality of datasets for Process Mining in the legal domain by introducing ProLiFIC, a comprehensive event log of the Italian lawmaking process from 1987 to 2022, created using LLMs from unstructured data, and proposed it as a benchmark for legal Process Mining.

Process Mining (PM), initially developed for industrial and business contexts, has recently been applied to social systems, including legal ones. However, PM's efficacy in the legal domain is limited by the accessibility and quality of datasets. We introduce ProLiFIC (Procedural Lawmaking Flow in Italian Chambers), a comprehensive event log of the Italian lawmaking process from 1987 to 2022. Created from unstructured data from the Normattiva portal and structured using large language models (LLMs), ProLiFIC aligns with recent efforts in integrating PM with LLMs. We exemplify preliminary analyses and propose ProLiFIC as a benchmark for legal PM, fostering new developments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes