Ontology-based Data Access for Extracting Event Logs from Legacy Data: The onprom Tool and Methodology

Diego Calvanese, Tahir Emre Kalayci, Marco Montali, and Stefano Tinella

Proc. of 20th Int. Conf. on Business Information Systems (BIS 2017). Volume 288 of Lecture Notes in Business Information Processing. 2017.

Process mining aims at discovering, monitoring, and improving business processes by extracting knowledge from event logs. In this respect, process mining can be applied only if there are proper event logs that are compatible with accepted standards, such as extensible event stream (XES). Unfortunately, in many real world set-ups, such event logs are not explicitly given, but instead are implicitly represented in legacy information systems. In this work, we exploit a framework and associated methodology for the extraction of XES event logs from relational data sources that we have recently introduced. Our approach is based on describing logs by means of suitable annotations of a conceptual model of the available data, and builds on the ontology-based data access (OBDA) paradigm for the actual log extraction. Making use of a real-world case study in the services domain, we compare our novel approach with a more traditional extract-transform-load based one, and are able to illustrate its added value. We also present a set of tools that we have developed and that support the OBDA-based log extraction framework. The tools are integrated as plugins of the ProM process mining suite.


@inproceedings{BIS-2017,
   title = "Ontology-based Data Access for Extracting Event Logs from Legacy
Data:  The onprom Tool and Methodology",
   year = "2017",
   author = "Diego Calvanese and Tahir Emre Kalayci and Marco Montali and
Stefano Tinella",
   booktitle = "Proc. of 20th Int. Conf. on Business Information Systems
(BIS 2017)",
   pages = "220--236",
   volume = "288",
   publisher = "Springer",
   series = "Lecture Notes in Business Information Processing",
   doi = "10.1007/978-3-319-59336-4  16",
}
pdf url