Generalized alignment-based trace clustering of process behavior
Document typeConference report
Rights accessOpen Access
Process mining techniques use event logs containing real process executions in order to mine, align and extend process models. The partition of an event log into trace variants facilitates the understanding and analysis of traces, so it is a common pre-processing in process mining environments. Trace clustering automates this partition; traditionally it has been applied without taking into consideration the availability of a process model. In this paper we extend our previous work on process model based trace clustering, by allowing cluster centroids to have a complex structure, that can range from a partial order, down to a subnet of the initial process model. This way, the new clustering framework presented in this paper is able to cluster together traces that are distant only due to concurrency or loop constructs in process models. We show the complexity analysis of the different instantiations of the trace clustering framework, and have implemented it in a prototype tool that has been tested on different datasets.
CitationBoltenhagen, M.; Chatain, T.; Carmona, J. Generalized alignment-based trace clustering of process behavior. A: International Conference on Applications and Theory of Petri Nets and Concurrency. "Application and Theory of Petri Nets and Concurrency, 40th International Conference, PETRI NETS 2019: Aachen, Germany, June 23–28, 2019: proceedings". Berlín: Springer, 2019, p. 237-257.