Alternative approach to tree-structured web log representation and mining

More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on...

Full description

Bibliographic Details
Main Authors: Hadzic, Fedja, Hecker, Michael
Other Authors: Mohand-Saïd Hacid
Format: Conference Paper
Published: IEEE Computer Society 2011
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/25708
_version_ 1848751783101333504
author Hadzic, Fedja
Hecker, Michael
author2 Mohand-Saïd Hacid
author_facet Mohand-Saïd Hacid
Hadzic, Fedja
Hecker, Michael
author_sort Hadzic, Fedja
building Curtin Institutional Repository
collection Online Access
description More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on the use of frequent subtree mining techniques to extract frequent user activity and navigational paths. In this paper we evaluate the use of other standard data mining techniques enabled by a recently proposed structure preserving flat data representation for tree-structured data. The initially proposed framework was adjusted to better suit the web log mining task. Experimental evaluation is performed on two real world web log datasets and comparisons are made with an existing state-of-the art classifier for tree-structured data. The results show the great potential of the method in enabling the application of a wider range of data mining/analysis techniques to tree-structured web log data.
first_indexed 2025-11-14T07:58:12Z
format Conference Paper
id curtin-20.500.11937-25708
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T07:58:12Z
publishDate 2011
publisher IEEE Computer Society
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-257082023-01-27T05:26:33Z Alternative approach to tree-structured web log representation and mining Hadzic, Fedja Hecker, Michael Mohand-Saïd Hacid tree-structured web logs web usage mining More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on the use of frequent subtree mining techniques to extract frequent user activity and navigational paths. In this paper we evaluate the use of other standard data mining techniques enabled by a recently proposed structure preserving flat data representation for tree-structured data. The initially proposed framework was adjusted to better suit the web log mining task. Experimental evaluation is performed on two real world web log datasets and comparisons are made with an existing state-of-the art classifier for tree-structured data. The results show the great potential of the method in enabling the application of a wider range of data mining/analysis techniques to tree-structured web log data. 2011 Conference Paper http://hdl.handle.net/20.500.11937/25708 10.1109/WI-IAT.2011.156 IEEE Computer Society fulltext
spellingShingle tree-structured web logs
web usage mining
Hadzic, Fedja
Hecker, Michael
Alternative approach to tree-structured web log representation and mining
title Alternative approach to tree-structured web log representation and mining
title_full Alternative approach to tree-structured web log representation and mining
title_fullStr Alternative approach to tree-structured web log representation and mining
title_full_unstemmed Alternative approach to tree-structured web log representation and mining
title_short Alternative approach to tree-structured web log representation and mining
title_sort alternative approach to tree-structured web log representation and mining
topic tree-structured web logs
web usage mining
url http://hdl.handle.net/20.500.11937/25708