Automated Calculation of Term Relatedness Weights for Semantic Searches

Information retrieval - finding and retrieving relevant sources of data, such as documents or geospatially located records - is a bottleneck in the process of accessing online data. Metadata describing data sources is variable in quality and quantity, textual descriptions are defined by data provide...

Full description

Bibliographic Details
Main Authors: Gulland, Elizabeth-Kate, Moncrieff, Simon, West, Geoff
Format: Conference Paper
Published: IEEE 2015
Online Access:http://hdl.handle.net/20.500.11937/61177
_version_ 1848760665360039936
author Gulland, Elizabeth-Kate
Moncrieff, Simon
West, Geoff
author_facet Gulland, Elizabeth-Kate
Moncrieff, Simon
West, Geoff
author_sort Gulland, Elizabeth-Kate
building Curtin Institutional Repository
collection Online Access
description Information retrieval - finding and retrieving relevant sources of data, such as documents or geospatially located records - is a bottleneck in the process of accessing online data. Metadata describing data sources is variable in quality and quantity, textual descriptions are defined by data providers and the terminology they use will not always match search terms, particularly in fields with specialised terminology, such as health. Augmenting the original query with related terms increases the likelihood of matching to relevant metadata. Related terms can be extracted from thesaurus and term definition resources or from the Semantic Web, which defines resources and relationships between them. However, relationships between terms are complicated by multiple interpretations, often dependent upon context (for example, 'sign' may mean a 'road sign' or a 'medical sign', such as fever). Including the strength and/or context of a relationship in a semantic link could help narrow down extra terms to those most relevant to the query. In this paper, methods for automatically calculating the relative strength of relationships between terms were investigated and compared for general and domain-specific terms. Calculations were based on a variety of textual resources including public, crowd-sourced online sources Wikipedia and Google search engine. Measures for term relatedness in a specialist domain were tested using health as a case study. Results show promise for automatic calculation of weights between terms, which can be used to develop weighted graphs for use in semantic searches.
first_indexed 2025-11-14T10:19:23Z
format Conference Paper
id curtin-20.500.11937-61177
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T10:19:23Z
publishDate 2015
publisher IEEE
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-611772018-02-06T00:43:49Z Automated Calculation of Term Relatedness Weights for Semantic Searches Gulland, Elizabeth-Kate Moncrieff, Simon West, Geoff Information retrieval - finding and retrieving relevant sources of data, such as documents or geospatially located records - is a bottleneck in the process of accessing online data. Metadata describing data sources is variable in quality and quantity, textual descriptions are defined by data providers and the terminology they use will not always match search terms, particularly in fields with specialised terminology, such as health. Augmenting the original query with related terms increases the likelihood of matching to relevant metadata. Related terms can be extracted from thesaurus and term definition resources or from the Semantic Web, which defines resources and relationships between them. However, relationships between terms are complicated by multiple interpretations, often dependent upon context (for example, 'sign' may mean a 'road sign' or a 'medical sign', such as fever). Including the strength and/or context of a relationship in a semantic link could help narrow down extra terms to those most relevant to the query. In this paper, methods for automatically calculating the relative strength of relationships between terms were investigated and compared for general and domain-specific terms. Calculations were based on a variety of textual resources including public, crowd-sourced online sources Wikipedia and Google search engine. Measures for term relatedness in a specialist domain were tested using health as a case study. Results show promise for automatic calculation of weights between terms, which can be used to develop weighted graphs for use in semantic searches. 2015 Conference Paper http://hdl.handle.net/20.500.11937/61177 10.1109/WI-IAT.2015.53 IEEE fulltext
spellingShingle Gulland, Elizabeth-Kate
Moncrieff, Simon
West, Geoff
Automated Calculation of Term Relatedness Weights for Semantic Searches
title Automated Calculation of Term Relatedness Weights for Semantic Searches
title_full Automated Calculation of Term Relatedness Weights for Semantic Searches
title_fullStr Automated Calculation of Term Relatedness Weights for Semantic Searches
title_full_unstemmed Automated Calculation of Term Relatedness Weights for Semantic Searches
title_short Automated Calculation of Term Relatedness Weights for Semantic Searches
title_sort automated calculation of term relatedness weights for semantic searches
url http://hdl.handle.net/20.500.11937/61177