A framework for application of tree-structured data mining to process log analysis

Many data mining and simulation based algorithms have been applied in the process mining field; nevertheless they mainly focus on the process discovery and conformance checking tasks. Even though the event logs are increasingly represented in semi-structured format using XML-based templates, commonl...

Full description

Bibliographic Details
Main Authors: Bui, Dang, Hadzic, Fedja, Potdar, Vidyasagar
Other Authors: Hujin, Y.
Format: Conference Paper
Published: Springer Verlag 2012
Online Access:http://hdl.handle.net/20.500.11937/19696
Description
Summary:Many data mining and simulation based algorithms have been applied in the process mining field; nevertheless they mainly focus on the process discovery and conformance checking tasks. Even though the event logs are increasingly represented in semi-structured format using XML-based templates, commonly used XML mining techniques have not been explored. In this paper, we investigate the application of tree mining techniques and propose a general framework, within which a wider range of structure aware data mining techniques can be applied. Decision tree learning and frequent pattern mining are used as a case in point in the experiments on publicly available real dataset. The results indicate the promising properties of the proposed framework in adding to the available set of tools for process log analysis by enabling (i) direct data mining of tree-structured process logs (ii) extraction of informative knowledge patterns and (iii) frequent pattern mining at lower minimum support thresholds.