Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents

The old documents in Jawi script are still being used widely for references. The quality of the hard copies of those scripts will be deteriorating as time passes. Manual reconstruction may take long time if the documents are sufficiently thick. The accuracy of the document image recognition algorith...

Full description

Bibliographic Details
Main Author: Zulcaffle, Tengku Mohd Afendi
Format: Thesis
Language:English
English
Published: 2007
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/5561/
http://psasir.upm.edu.my/id/eprint/5561/1/ITMA_2007_2.pdf
_version_ 1848840133437030400
author Zulcaffle, Tengku Mohd Afendi
author_facet Zulcaffle, Tengku Mohd Afendi
author_sort Zulcaffle, Tengku Mohd Afendi
building UPM Institutional Repository
collection Online Access
description The old documents in Jawi script are still being used widely for references. The quality of the hard copies of those scripts will be deteriorating as time passes. Manual reconstruction may take long time if the documents are sufficiently thick. The accuracy of the document image recognition algorithms is much dependent on the level of noise on the document. Therefore, the development of the historical Jawi character reconstruction algorithm is a significant contributions to the success of the old Jawi manuscript maintenance and recognition systems. The Background Subtraction technique has proved to be the best algorithm when historical document images were evaluated. The proposed technique has improved the algorithm by incorporating an autonomous decision making, that makes the binarization technique a scale invariant algorithm. The prefiltering and post processing will further enhance the ability of the algorithm to remove noise from the documents. In the post binarization algorithm, separation techniques between characters with holes and without holes is introduced in order for different morphological operations to be applied to those characters. This method will enhance connection between broken characters but still preserving the originality of the document. A noise model has been developed to test the reliability of the proposed algorithm. The model was developed based on several predefined criteria. The algorithms have been implemented using Matlab software version 6.5. The reliability of the proposed algorithms have been tested over simulated and real data. Comparison has been made between the Background Subtraction technique and the proposed method by manual inspection and mathematical evaluation. The results of the algorithms were mathematically evaluated using the Relative Foreground Area Error. Results have shown that better performance has been obtained using the proposed method. The framework managed to create historical Jawi characters more presentable. The system is not only applicable to historical Jawi characters, it can be easily adapted to any other historical characters in different languages.
first_indexed 2025-11-15T07:22:30Z
format Thesis
id upm-5561
institution Universiti Putra Malaysia
institution_category Local University
language English
English
last_indexed 2025-11-15T07:22:30Z
publishDate 2007
recordtype eprints
repository_type Digital Repository
spelling upm-55612013-05-27T07:23:43Z http://psasir.upm.edu.my/id/eprint/5561/ Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents Zulcaffle, Tengku Mohd Afendi The old documents in Jawi script are still being used widely for references. The quality of the hard copies of those scripts will be deteriorating as time passes. Manual reconstruction may take long time if the documents are sufficiently thick. The accuracy of the document image recognition algorithms is much dependent on the level of noise on the document. Therefore, the development of the historical Jawi character reconstruction algorithm is a significant contributions to the success of the old Jawi manuscript maintenance and recognition systems. The Background Subtraction technique has proved to be the best algorithm when historical document images were evaluated. The proposed technique has improved the algorithm by incorporating an autonomous decision making, that makes the binarization technique a scale invariant algorithm. The prefiltering and post processing will further enhance the ability of the algorithm to remove noise from the documents. In the post binarization algorithm, separation techniques between characters with holes and without holes is introduced in order for different morphological operations to be applied to those characters. This method will enhance connection between broken characters but still preserving the originality of the document. A noise model has been developed to test the reliability of the proposed algorithm. The model was developed based on several predefined criteria. The algorithms have been implemented using Matlab software version 6.5. The reliability of the proposed algorithms have been tested over simulated and real data. Comparison has been made between the Background Subtraction technique and the proposed method by manual inspection and mathematical evaluation. The results of the algorithms were mathematically evaluated using the Relative Foreground Area Error. Results have shown that better performance has been obtained using the proposed method. The framework managed to create historical Jawi characters more presentable. The system is not only applicable to historical Jawi characters, it can be easily adapted to any other historical characters in different languages. 2007 Thesis NonPeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/5561/1/ITMA_2007_2.pdf Zulcaffle, Tengku Mohd Afendi (2007) Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents. Masters thesis, Universiti Putra Malaysia. Jawi alphabet. Information storage and retrieval systems. English
spellingShingle Jawi alphabet.
Information storage and retrieval systems.
Zulcaffle, Tengku Mohd Afendi
Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents
title Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents
title_full Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents
title_fullStr Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents
title_full_unstemmed Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents
title_short Development of an Automated Technique for Reconstructing Jawi Characters in Historical Documents
title_sort development of an automated technique for reconstructing jawi characters in historical documents
topic Jawi alphabet.
Information storage and retrieval systems.
url http://psasir.upm.edu.my/id/eprint/5561/
http://psasir.upm.edu.my/id/eprint/5561/1/ITMA_2007_2.pdf