A semantic crawler based on an extended CBR algorithm

A semantic (web) crawler refers to a series of web crawlers designed for harvesting semantic web content. This paper presents the framework of a semantic crawler that can abstract metadata from online webpages and cluster the metadata by associating them with ontological concepts. The clustering is...

Full description

Bibliographic Details
Main Authors: Dong, Hai, Hussain, Farookh Khadeer, Chang, Elizabeth
Other Authors: R. Meersman
Format: Conference Paper
Published: Springer 2008
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/36321
_version_ 1848754736273031168
author Dong, Hai
Hussain, Farookh Khadeer
Chang, Elizabeth
author2 R. Meersman
author_facet R. Meersman
Dong, Hai
Hussain, Farookh Khadeer
Chang, Elizabeth
author_sort Dong, Hai
building Curtin Institutional Repository
collection Online Access
description A semantic (web) crawler refers to a series of web crawlers designed for harvesting semantic web content. This paper presents the framework of a semantic crawler that can abstract metadata from online webpages and cluster the metadata by associating them with ontological concepts. The clustering is based on a CBR algorithm which is adopted in the field of problem solving. We reveal the technical details with regard to ontological concept and metadata format, and the extended CBR algorithm. In addition, the system implementation and evaluation details are provided in detail, finalized by our conclusion and further works.
first_indexed 2025-11-14T08:45:09Z
format Conference Paper
id curtin-20.500.11937-36321
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T08:45:09Z
publishDate 2008
publisher Springer
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-363212022-12-07T06:50:49Z A semantic crawler based on an extended CBR algorithm Dong, Hai Hussain, Farookh Khadeer Chang, Elizabeth R. Meersman Z. Tari P. Herrero extended CBR algorithm ontological concepts metadata abstraction semantic crawler A semantic (web) crawler refers to a series of web crawlers designed for harvesting semantic web content. This paper presents the framework of a semantic crawler that can abstract metadata from online webpages and cluster the metadata by associating them with ontological concepts. The clustering is based on a CBR algorithm which is adopted in the field of problem solving. We reveal the technical details with regard to ontological concept and metadata format, and the extended CBR algorithm. In addition, the system implementation and evaluation details are provided in detail, finalized by our conclusion and further works. 2008 Conference Paper http://hdl.handle.net/20.500.11937/36321 10.1007/978-3-540-88875-8_135 Springer fulltext
spellingShingle extended CBR algorithm
ontological concepts
metadata abstraction
semantic crawler
Dong, Hai
Hussain, Farookh Khadeer
Chang, Elizabeth
A semantic crawler based on an extended CBR algorithm
title A semantic crawler based on an extended CBR algorithm
title_full A semantic crawler based on an extended CBR algorithm
title_fullStr A semantic crawler based on an extended CBR algorithm
title_full_unstemmed A semantic crawler based on an extended CBR algorithm
title_short A semantic crawler based on an extended CBR algorithm
title_sort semantic crawler based on an extended cbr algorithm
topic extended CBR algorithm
ontological concepts
metadata abstraction
semantic crawler
url http://hdl.handle.net/20.500.11937/36321