Alternative approach to tree-structured web log representation and mining
More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on...
| Main Authors: | , |
|---|---|
| Other Authors: | |
| Format: | Conference Paper |
| Published: |
IEEE Computer Society
2011
|
| Subjects: | |
| Online Access: | http://hdl.handle.net/20.500.11937/25708 |
| _version_ | 1848751783101333504 |
|---|---|
| author | Hadzic, Fedja Hecker, Michael |
| author2 | Mohand-Saïd Hacid |
| author_facet | Mohand-Saïd Hacid Hadzic, Fedja Hecker, Michael |
| author_sort | Hadzic, Fedja |
| building | Curtin Institutional Repository |
| collection | Online Access |
| description | More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on the use of frequent subtree mining techniques to extract frequent user activity and navigational paths. In this paper we evaluate the use of other standard data mining techniques enabled by a recently proposed structure preserving flat data representation for tree-structured data. The initially proposed framework was adjusted to better suit the web log mining task. Experimental evaluation is performed on two real world web log datasets and comparisons are made with an existing state-of-the art classifier for tree-structured data. The results show the great potential of the method in enabling the application of a wider range of data mining/analysis techniques to tree-structured web log data. |
| first_indexed | 2025-11-14T07:58:12Z |
| format | Conference Paper |
| id | curtin-20.500.11937-25708 |
| institution | Curtin University Malaysia |
| institution_category | Local University |
| last_indexed | 2025-11-14T07:58:12Z |
| publishDate | 2011 |
| publisher | IEEE Computer Society |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | curtin-20.500.11937-257082023-01-27T05:26:33Z Alternative approach to tree-structured web log representation and mining Hadzic, Fedja Hecker, Michael Mohand-Saïd Hacid tree-structured web logs web usage mining More recent approaches to web log data representation aim to capture the user navigational patterns with respect to the overall structure of the web site. One such representation is tree-structured log files which is the focus of this work. Most existing methods for analyzing such data are based on the use of frequent subtree mining techniques to extract frequent user activity and navigational paths. In this paper we evaluate the use of other standard data mining techniques enabled by a recently proposed structure preserving flat data representation for tree-structured data. The initially proposed framework was adjusted to better suit the web log mining task. Experimental evaluation is performed on two real world web log datasets and comparisons are made with an existing state-of-the art classifier for tree-structured data. The results show the great potential of the method in enabling the application of a wider range of data mining/analysis techniques to tree-structured web log data. 2011 Conference Paper http://hdl.handle.net/20.500.11937/25708 10.1109/WI-IAT.2011.156 IEEE Computer Society fulltext |
| spellingShingle | tree-structured web logs web usage mining Hadzic, Fedja Hecker, Michael Alternative approach to tree-structured web log representation and mining |
| title | Alternative approach to tree-structured web log representation and mining |
| title_full | Alternative approach to tree-structured web log representation and mining |
| title_fullStr | Alternative approach to tree-structured web log representation and mining |
| title_full_unstemmed | Alternative approach to tree-structured web log representation and mining |
| title_short | Alternative approach to tree-structured web log representation and mining |
| title_sort | alternative approach to tree-structured web log representation and mining |
| topic | tree-structured web logs web usage mining |
| url | http://hdl.handle.net/20.500.11937/25708 |