A semantic crawler based on an extended CBR algorithm
A semantic (web) crawler refers to a series of web crawlers designed for harvesting semantic web content. This paper presents the framework of a semantic crawler that can abstract metadata from online webpages and cluster the metadata by associating them with ontological concepts. The clustering is...
| Main Authors: | , , |
|---|---|
| Other Authors: | |
| Format: | Conference Paper |
| Published: |
Springer
2008
|
| Subjects: | |
| Online Access: | http://hdl.handle.net/20.500.11937/36321 |
| _version_ | 1848754736273031168 |
|---|---|
| author | Dong, Hai Hussain, Farookh Khadeer Chang, Elizabeth |
| author2 | R. Meersman |
| author_facet | R. Meersman Dong, Hai Hussain, Farookh Khadeer Chang, Elizabeth |
| author_sort | Dong, Hai |
| building | Curtin Institutional Repository |
| collection | Online Access |
| description | A semantic (web) crawler refers to a series of web crawlers designed for harvesting semantic web content. This paper presents the framework of a semantic crawler that can abstract metadata from online webpages and cluster the metadata by associating them with ontological concepts. The clustering is based on a CBR algorithm which is adopted in the field of problem solving. We reveal the technical details with regard to ontological concept and metadata format, and the extended CBR algorithm. In addition, the system implementation and evaluation details are provided in detail, finalized by our conclusion and further works. |
| first_indexed | 2025-11-14T08:45:09Z |
| format | Conference Paper |
| id | curtin-20.500.11937-36321 |
| institution | Curtin University Malaysia |
| institution_category | Local University |
| last_indexed | 2025-11-14T08:45:09Z |
| publishDate | 2008 |
| publisher | Springer |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | curtin-20.500.11937-363212022-12-07T06:50:49Z A semantic crawler based on an extended CBR algorithm Dong, Hai Hussain, Farookh Khadeer Chang, Elizabeth R. Meersman Z. Tari P. Herrero extended CBR algorithm ontological concepts metadata abstraction semantic crawler A semantic (web) crawler refers to a series of web crawlers designed for harvesting semantic web content. This paper presents the framework of a semantic crawler that can abstract metadata from online webpages and cluster the metadata by associating them with ontological concepts. The clustering is based on a CBR algorithm which is adopted in the field of problem solving. We reveal the technical details with regard to ontological concept and metadata format, and the extended CBR algorithm. In addition, the system implementation and evaluation details are provided in detail, finalized by our conclusion and further works. 2008 Conference Paper http://hdl.handle.net/20.500.11937/36321 10.1007/978-3-540-88875-8_135 Springer fulltext |
| spellingShingle | extended CBR algorithm ontological concepts metadata abstraction semantic crawler Dong, Hai Hussain, Farookh Khadeer Chang, Elizabeth A semantic crawler based on an extended CBR algorithm |
| title | A semantic crawler based on an extended CBR algorithm |
| title_full | A semantic crawler based on an extended CBR algorithm |
| title_fullStr | A semantic crawler based on an extended CBR algorithm |
| title_full_unstemmed | A semantic crawler based on an extended CBR algorithm |
| title_short | A semantic crawler based on an extended CBR algorithm |
| title_sort | semantic crawler based on an extended cbr algorithm |
| topic | extended CBR algorithm ontological concepts metadata abstraction semantic crawler |
| url | http://hdl.handle.net/20.500.11937/36321 |