Skip to Main content Skip to Navigation
Conference papers

Prototype Selection using Clustering and Conformance Metrics for Process Discovery

Sani Mohammadreza Fani 1 Mathilde Boltenhagen 2 Aalst Wil van Der 1
2 MEXICO - Modeling and Exploitation of Interaction and Concurrency
Inria Saclay - Ile de France, LSV - Laboratoire Spécification et Vérification
Abstract : Automated process discovery algorithms aim to automatically create process models based on event data that is captured during the execution of business processes. These algorithms usually tend to use all of the event data to discover a process model. Using all (i.e., less common) behavior may lead to discover imprecise and/or complex process models that may conceal important information of processes. In this paper, we introduce a new incremental prototype selection algorithm based on the clustering of process instances to address this problem. The method iteratively computes a unique process model from a different set of selected prototypes that are representative of whole event data and stops when conformance metrics decrease. This method has been implemented using both ProM and RapidProM. We applied the proposed method on several real event datasets with state-of-the-art process discovery algorithms. Results show that using the proposed method leads to improve the general quality of discovered process models.
Document type :
Conference papers
Complete list of metadata
Contributor : Mathilde Boltenhagen Connect in order to contact the contributor
Submitted on : Monday, February 8, 2021 - 9:40:45 AM
Last modification on : Friday, January 21, 2022 - 3:12:00 AM
Long-term archiving on: : Sunday, May 9, 2021 - 6:18:24 PM


Files produced by the author(s)


  • HAL Id : hal-03134093, version 1


Sani Mohammadreza Fani, Mathilde Boltenhagen, Aalst Wil van Der. Prototype Selection using Clustering and Conformance Metrics for Process Discovery. BPI’20 - 16th International Workshop on Business Process Intelligence, Sep 2020, Sevilla, Spain. ⟨hal-03134093⟩



Les métriques sont temporairement indisponibles