DBAICRLGSep 17, 2021

SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining

arXiv:2109.08501v119 citations
Originality Incremental advance
AI Analysis

This addresses privacy concerns in business process analysis for organizations, though it is incremental as it builds on differential privacy methods.

The paper tackles the problem of privacy-preserving process mining by introducing semantics-aware anonymization to avoid generating unrealistic traces, resulting in event logs with significantly higher utility than existing approaches.

Privacy-preserving process mining enables the analysis of business processes using event logs, while giving guarantees on the protection of sensitive information on process stakeholders. To this end, existing approaches add noise to the results of queries that extract properties of an event log, such as the frequency distribution of trace variants, for analysis.Noise insertion neglects the semantics of the process, though, and may generate traces not present in the original log. This is problematic. It lowers the utility of the published data and makes noise easily identifiable, as some traces will violate well-known semantic constraints.In this paper, we therefore argue for privacy preservation that incorporates a process semantics. For common trace-variant queries, we show how, based on the exponential mechanism, semantic constraints are incorporated to ensure differential privacy of the query result. Experiments demonstrate that our semantics-aware anonymization yields event logs of significantly higher utility than existing approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes