Self-adaptive semantic focused crawler for mining services information discovery

It is well recognized that the Internet has become the largest marketplace in the world, and online advertising is very popular with numerous industries, including the traditional mining service industry where mining service advertisements are effective carriers of mining service information. Howeve...

Full description

Bibliographic Details
Main Authors: Dong, Hai, Hussain, F.
Format: Journal Article
Published: IEEE Computer Society 2014
Online Access:http://hdl.handle.net/20.500.11937/47949
_version_ 1848757975046422528
author Dong, Hai
Hussain, F.
author_facet Dong, Hai
Hussain, F.
author_sort Dong, Hai
building Curtin Institutional Repository
collection Online Access
description It is well recognized that the Internet has become the largest marketplace in the world, and online advertising is very popular with numerous industries, including the traditional mining service industry where mining service advertisements are effective carriers of mining service information. However, service users may encounter three major issues - heterogeneity, ubiquity, and ambiguity, when searching for mining service information over the Internet. In this paper, we present the framework of a novel self-adaptive semantic focused crawler - SASF crawler, with the purpose of precisely and efficiently discovering, formatting, and indexing mining service information over the Internet, by taking into account the three major issues. This framework incorporates the technologies of semantic focused crawling and ontology learning, in order to maintain the performance of this crawler, regardless of the variety in the Web environment. The innovations of this research lie in the design of an unsupervised framework for vocabulary-based ontology learning, and a hybrid algorithm for matching semantically relevant concepts and metadata. A series of experiments are conducted in order to evaluate the performance of this crawler. The conclusion and the direction of future work are given in the final section. © 2012 IEEE.
first_indexed 2025-11-14T09:36:37Z
format Journal Article
id curtin-20.500.11937-47949
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T09:36:37Z
publishDate 2014
publisher IEEE Computer Society
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-479492017-09-13T14:18:22Z Self-adaptive semantic focused crawler for mining services information discovery Dong, Hai Hussain, F. It is well recognized that the Internet has become the largest marketplace in the world, and online advertising is very popular with numerous industries, including the traditional mining service industry where mining service advertisements are effective carriers of mining service information. However, service users may encounter three major issues - heterogeneity, ubiquity, and ambiguity, when searching for mining service information over the Internet. In this paper, we present the framework of a novel self-adaptive semantic focused crawler - SASF crawler, with the purpose of precisely and efficiently discovering, formatting, and indexing mining service information over the Internet, by taking into account the three major issues. This framework incorporates the technologies of semantic focused crawling and ontology learning, in order to maintain the performance of this crawler, regardless of the variety in the Web environment. The innovations of this research lie in the design of an unsupervised framework for vocabulary-based ontology learning, and a hybrid algorithm for matching semantically relevant concepts and metadata. A series of experiments are conducted in order to evaluate the performance of this crawler. The conclusion and the direction of future work are given in the final section. © 2012 IEEE. 2014 Journal Article http://hdl.handle.net/20.500.11937/47949 10.1109/TII.2012.2234472 IEEE Computer Society restricted
spellingShingle Dong, Hai
Hussain, F.
Self-adaptive semantic focused crawler for mining services information discovery
title Self-adaptive semantic focused crawler for mining services information discovery
title_full Self-adaptive semantic focused crawler for mining services information discovery
title_fullStr Self-adaptive semantic focused crawler for mining services information discovery
title_full_unstemmed Self-adaptive semantic focused crawler for mining services information discovery
title_short Self-adaptive semantic focused crawler for mining services information discovery
title_sort self-adaptive semantic focused crawler for mining services information discovery
url http://hdl.handle.net/20.500.11937/47949