Process-Aware Information Systems (PAIS) are extensively employed to support organizational workflows, with configurations that often differ across various usage contexts. Analyzing the event logs they generate is essential for understanding this variability; however, traditional process mining techniques often face scalability challenges, particularly when dealing with loops and a large number of process instances. This paper introduces ReACMe, a parametric, unsupervised clustering methodology that bypasses model generation by leveraging n-gram-based features and a repetition-aware dissimilarity measure. Using the k-medoids algorithm, ReACMe effectively groups similar logs and allows to identify representative medoids. The approach is validated on both public datasets and a real-world e-government scenario, demonstrating its efficiency and practical applicability.
ReACMe: Repetition Aware Clustering Methodology for Business Process Log Collections
Luciani, Caterina;Bucchicchio, Luigi;Morichetta, Andrea;Piangerelli, Marco;Polini, Andrea
2025-01-01
Abstract
Process-Aware Information Systems (PAIS) are extensively employed to support organizational workflows, with configurations that often differ across various usage contexts. Analyzing the event logs they generate is essential for understanding this variability; however, traditional process mining techniques often face scalability challenges, particularly when dealing with loops and a large number of process instances. This paper introduces ReACMe, a parametric, unsupervised clustering methodology that bypasses model generation by leveraging n-gram-based features and a repetition-aware dissimilarity measure. Using the k-medoids algorithm, ReACMe effectively groups similar logs and allows to identify representative medoids. The approach is validated on both public datasets and a real-world e-government scenario, demonstrating its efficiency and practical applicability.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


