Skip to Main content Skip to Navigation
Conference papers

DB-XES: Enabling Process Discovery in the Large

Abstract : Dealing with the abundance of event data is one of the main process discovery challenges. Current process discovery techniques are able to efficiently handle imported event log files that fit in the computer’s memory. Once data files get bigger, scalability quickly drops since the speed required to access the data becomes a limiting factor. This paper proposes a new technique based on relational database technology as a solution for scalable process discovery. A relational database is used both for storing event data (i.e. we move the location of the data) and for pre-processing the event data (i.e. we move some computations from analysis-time to insertion-time). To this end, we first introduce DB-XES as a database schema which resembles the standard XES structure, we provide a transparent way to access event data stored in DB-XES, and we show how this greatly improves on the memory requirements of the state-of-the-art process discovery techniques. Secondly, we show how to move the computation of intermediate data structures to the database engine, to reduce the time required during process discovery. The work presented in this paper is implemented in ProM tool, and a range of experiments demonstrates the feasibility of our approach.
Document type :
Conference papers
Complete list of metadata

Cited literature [29 references]  Display  Hide  Download

https://hal.inria.fr/hal-01769759
Contributor : Hal Ifip <>
Submitted on : Wednesday, April 18, 2018 - 1:01:33 PM
Last modification on : Wednesday, April 18, 2018 - 2:52:09 PM

File

463443_1_En_4_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Alifah Syamsiyah, Boudewijn Dongen, Wil Aalst. DB-XES: Enabling Process Discovery in the Large. 6th International Symposium on Data-Driven Process Discovery and Analysis (SIMPDA), Dec 2016, Graz, Austria. pp.53-77, ⟨10.1007/978-3-319-74161-1_4⟩. ⟨hal-01769759⟩

Share

Metrics

Record views

161

Files downloads

4