Alignment-based trace clustering

Cita com:
hdl:2117/127952
Document typeConference report
Defense date2017
PublisherSpringer
Rights accessOpen Access
Abstract
A novel method to cluster event log traces is presented in this paper. In contrast to the approaches in the literature, the clustering approach of this paper assumes an additional input: a process model that describes the current process. The core idea of the algorithm is to use model traces as centroids of the clusters detected, computed from a generalization of the notion of alignment. This way, model explanations of observed behavior are the driving force to compute the clusters, instead of current model agnostic approaches, e.g., which group log traces merely on their vector-space similarity. We believe alignment-based trace clustering provides results more useful for stakeholders. Moreover, in case of log incompleteness, noisy logs or concept drift, they can be more robust for dealing with highly deviating traces. The technique of this paper can be combined with any clustering technique to provide model explanations to the clusters computed. The proposed technique relies on encoding the individual alignment problems into the (pseudo-)Boolean domain, and has been implemented in our tool DarkSider that uses an open-source solver.
CitationChatain, T.; Carmona, J.; Dongen, B. Alignment-based trace clustering. A: International Conference on Conceptual Modeling. "Conceptual Modeling, 36th International Conference, ER 2017: Valencia, Spain, November 6-9, 2017: proceedings". Berlín: Springer, 2017, p. 295-308.
ISBN978-3-319-69904-2
Publisher versionhttps://link.springer.com/chapter/10.1007/978-3-319-69904-2_24
Files | Description | Size | Format | View |
---|---|---|---|---|
Alignment-Based Trace Clustering.pdf | 393,3Kb | View/Open |
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder