U3 - mining unordered embedded subtrees using TMG candidate generation

In this paper we present an algorithm for mining of unordered embedded subtrees. This is an importantproblem for association rule mining from semistructured documents, and it has important applications in many biomedical, web and scientific domains. The proposed U3 algorithm is an extension of our g...

Full description

Bibliographic Details
Main Authors: Hadzic, Fedja, Tan, Henry, Dillon, Tharam S.
Other Authors: Chengqi Zhang
Format: Conference Paper
Published: Institute of Electrical and Electronics Engineers (IEEE) Computer Society 2008
Online Access:http://hdl.handle.net/20.500.11937/44704
Description
Summary:In this paper we present an algorithm for mining of unordered embedded subtrees. This is an importantproblem for association rule mining from semistructured documents, and it has important applications in many biomedical, web and scientific domains. The proposed U3 algorithm is an extension of our general tree model guided (TMG) candidate generation framework and it considers both transaction based and occurrence match support. Synthetic and real world data sets are used to experimentally demonstrate the efficiency of our approach to the problem, and the flexibility of our general TMG framework.