Keyword Competition Approach In Ranked Document Retrieval

due to the availability of huge storage spaces, multiple storage devices and different storage media. The rapid growth of data in the database will eventually render the data unmanageable and cause problems in retrieval, where the users are unable to retrieve the right document. This is one of...

Full description

Bibliographic Details
Main Author:	Sihombing, Poltak
Format:	Thesis
Language:	English
Published:	2010
Subjects:	QA75.5-76.95 Electronic computers. Computer science
Online Access:	http://eprints.usm.my/43051/ http://eprints.usm.my/43051/1/Poltak_Sihombing24.pdf

_version_	1848879725623115776
author	Sihombing, Poltak
author_facet	Sihombing, Poltak
author_sort	Sihombing, Poltak
building	USM Institutional Repository
collection	Online Access
description	due to the availability of huge storage spaces, multiple storage devices and different storage media. The rapid growth of data in the database will eventually render the data unmanageable and cause problems in retrieval, where the users are unable to retrieve the right document. This is one of the most important problems in IRS. The use of keywords is one of the methods in IRS which can solve this problem. In this thesis, we propose a methodology in GA (Genetic Algorithm) which is known as Keyword Competition (KC) approach. KC is a competition scheme in finding the best keyword, known as ‘keyword solution’ (KS), among the available keywords. The keyword solution is then matched to the document collection in the database in order to retrieve the most relevant document. In this research, the collection of proceedings of BADAN TENAGA ATOM NASIONAL (BATAN) Indonesia, presented by University of Indonesia (UI), Jakarta was used as a standard dataset. We also propose a keyword based ranking scheme aimed to better rank the retrieved document in the spirit of presenting the most relevant document to the users. Keyword based ranking scheme consists of two (2) main phases; namely keyword solution matching and similarity percentage formulation. In the keyword matching process, the system will match those KS by finding the same words in the title, abstract & keyword of each document collection in the database. The similarity percentage formulation is used to rank the retrieved document based on the similarity value. The scheme was tested with two different fitness formulations, i.e. Jaccard’s function and Cosine’s function. We then compare the result of KC to the similarity level in Hopfield method. A prototype called Journal Browser System (JBS) based on this scheme was developed. The results collected from JBS provide the evidence that KC approach and keyword based ranking scheme give better performance compared to Hopfield method.
first_indexed	2025-11-15T17:51:48Z
format	Thesis
id	usm-43051
institution	Universiti Sains Malaysia
institution_category	Local University
language	English
last_indexed	2025-11-15T17:51:48Z
publishDate	2010
recordtype	eprints
repository_type	Digital Repository
spelling	usm-430512019-04-12T05:26:52Z http://eprints.usm.my/43051/ Keyword Competition Approach In Ranked Document Retrieval Sihombing, Poltak QA75.5-76.95 Electronic computers. Computer science due to the availability of huge storage spaces, multiple storage devices and different storage media. The rapid growth of data in the database will eventually render the data unmanageable and cause problems in retrieval, where the users are unable to retrieve the right document. This is one of the most important problems in IRS. The use of keywords is one of the methods in IRS which can solve this problem. In this thesis, we propose a methodology in GA (Genetic Algorithm) which is known as Keyword Competition (KC) approach. KC is a competition scheme in finding the best keyword, known as ‘keyword solution’ (KS), among the available keywords. The keyword solution is then matched to the document collection in the database in order to retrieve the most relevant document. In this research, the collection of proceedings of BADAN TENAGA ATOM NASIONAL (BATAN) Indonesia, presented by University of Indonesia (UI), Jakarta was used as a standard dataset. We also propose a keyword based ranking scheme aimed to better rank the retrieved document in the spirit of presenting the most relevant document to the users. Keyword based ranking scheme consists of two (2) main phases; namely keyword solution matching and similarity percentage formulation. In the keyword matching process, the system will match those KS by finding the same words in the title, abstract & keyword of each document collection in the database. The similarity percentage formulation is used to rank the retrieved document based on the similarity value. The scheme was tested with two different fitness formulations, i.e. Jaccard’s function and Cosine’s function. We then compare the result of KC to the similarity level in Hopfield method. A prototype called Journal Browser System (JBS) based on this scheme was developed. The results collected from JBS provide the evidence that KC approach and keyword based ranking scheme give better performance compared to Hopfield method. 2010-06 Thesis NonPeerReviewed application/pdf en http://eprints.usm.my/43051/1/Poltak_Sihombing24.pdf Sihombing, Poltak (2010) Keyword Competition Approach In Ranked Document Retrieval. PhD thesis, Universiti Sains Malaysia.
spellingShingle	QA75.5-76.95 Electronic computers. Computer science Sihombing, Poltak Keyword Competition Approach In Ranked Document Retrieval
title	Keyword Competition Approach In Ranked Document Retrieval
title_full	Keyword Competition Approach In Ranked Document Retrieval
title_fullStr	Keyword Competition Approach In Ranked Document Retrieval
title_full_unstemmed	Keyword Competition Approach In Ranked Document Retrieval
title_short	Keyword Competition Approach In Ranked Document Retrieval
title_sort	keyword competition approach in ranked document retrieval
topic	QA75.5-76.95 Electronic computers. Computer science
url	http://eprints.usm.my/43051/ http://eprints.usm.my/43051/1/Poltak_Sihombing24.pdf

Keyword Competition Approach In Ranked Document Retrieval

Similar Items