The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman

The research applies the process of document segmentation in which document is separated into many parts. The term segmentation is usually used in which the document retrieval is significant. It is important since the content of documents appear as one big part. Later in the retrieval development, t...

Full description

Bibliographic Details
Main Author: Abdul Rahman, Hayati
Format: Research Reports
Language:English
Published: 2006
Subjects:
Online Access:https://ir.uitm.edu.my/id/eprint/2843/
_version_ 1848802662928089088
author Abdul Rahman, Hayati
author_facet Abdul Rahman, Hayati
author_sort Abdul Rahman, Hayati
building UiTM Institutional Repository
collection Online Access
description The research applies the process of document segmentation in which document is separated into many parts. The term segmentation is usually used in which the document retrieval is significant. It is important since the content of documents appear as one big part. Later in the retrieval development, the segmentation would be used for the indexing part. The letter document has their own format, which consists of many parts. The prototype has been developed to allow the segmentation and the existence of content-based to the letter document. The documents are divided into smaller, recognized labels that are intensive and flexible for managing, editing, and extracting. The target of this thesis is to apply the standard of official letter for the system, as well as to develop the algorithm which will segment the letter documents, and convert to XML documents. The software used for this prototype is Visual Basic 6.0. More over, the information retrieval makes the retrieval of document or collection of data in the storage media more efficient, effective, relevant, faster and more reliable than before. Such indexing techniques may influence the effectiveness of retrieval itself. The extension component within the indexing structure may also influence the performance of the retrieval process. This research is to develop a prototype for indexing algorithm considering tag weighting for the XML document and also to test the indexer with the existing document. In order to perform efficient retrieval on documents, appropriate index structure or algorithm must be used which include the structural information. The inverted file method has been used for the indexing techniques to develop the indexing algorithm of the FTMSK official letter. The relevancy of the document for the retrieval by using the algorithm has been successful achieved and it can prove that the prototype can increase the relevancy of document retrieval.
first_indexed 2025-11-14T21:26:55Z
format Research Reports
id uitm-2843
institution Universiti Teknologi MARA
institution_category Local University
language English
last_indexed 2025-11-14T21:26:55Z
publishDate 2006
recordtype eprints
repository_type Digital Repository
spelling uitm-28432021-08-23T06:32:43Z https://ir.uitm.edu.my/id/eprint/2843/ The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman Abdul Rahman, Hayati Database management Information storage and retrieval systems The research applies the process of document segmentation in which document is separated into many parts. The term segmentation is usually used in which the document retrieval is significant. It is important since the content of documents appear as one big part. Later in the retrieval development, the segmentation would be used for the indexing part. The letter document has their own format, which consists of many parts. The prototype has been developed to allow the segmentation and the existence of content-based to the letter document. The documents are divided into smaller, recognized labels that are intensive and flexible for managing, editing, and extracting. The target of this thesis is to apply the standard of official letter for the system, as well as to develop the algorithm which will segment the letter documents, and convert to XML documents. The software used for this prototype is Visual Basic 6.0. More over, the information retrieval makes the retrieval of document or collection of data in the storage media more efficient, effective, relevant, faster and more reliable than before. Such indexing techniques may influence the effectiveness of retrieval itself. The extension component within the indexing structure may also influence the performance of the retrieval process. This research is to develop a prototype for indexing algorithm considering tag weighting for the XML document and also to test the indexer with the existing document. In order to perform efficient retrieval on documents, appropriate index structure or algorithm must be used which include the structural information. The inverted file method has been used for the indexing techniques to develop the indexing algorithm of the FTMSK official letter. The relevancy of the document for the retrieval by using the algorithm has been successful achieved and it can prove that the prototype can increase the relevancy of document retrieval. 2006 Research Reports NonPeerReviewed text en https://ir.uitm.edu.my/id/eprint/2843/1/2843.pdf Abdul Rahman, Hayati (2006) The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman. (2006) [Research Reports] (Unpublished)
spellingShingle Database management
Information storage and retrieval systems
Abdul Rahman, Hayati
The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman
title The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman
title_full The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman
title_fullStr The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman
title_full_unstemmed The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman
title_short The evaluation of content-oriented XML document retrieval: a case study of FTMSK official letter / Hayati Abdul Rahman
title_sort evaluation of content-oriented xml document retrieval: a case study of ftmsk official letter / hayati abdul rahman
topic Database management
Information storage and retrieval systems
url https://ir.uitm.edu.my/id/eprint/2843/