Optical character recognition for Quranic image similarity matching

The detection and recognition and then conversion of the characters in an image into a text are called optical character recognition (OCR). A distinctive-type of OCR is used to process Arabic characters, namely, Arabic OCR. OCR is increasingly used in many applications, where this process is preferr...

Full description

Bibliographic Details
Main Authors: Alotaibi, Faiz, Abdullah, Muhamad Taufik, Abdullah, Rusli, O. K. Rahmat, Rahmita Wirza, Hashem, Ibrahim Abaker Targio, Sangaiah, Arun Kumar
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers 2017
Online Access:http://psasir.upm.edu.my/id/eprint/75148/
http://psasir.upm.edu.my/id/eprint/75148/1/Optical%20character%20recognition%20for%20Quranic%20image%20similarity%20matching.pdf
_version_ 1848857616246112256
author Alotaibi, Faiz
Abdullah, Muhamad Taufik
Abdullah, Rusli
O. K. Rahmat, Rahmita Wirza
Hashem, Ibrahim Abaker Targio
Sangaiah, Arun Kumar
author_facet Alotaibi, Faiz
Abdullah, Muhamad Taufik
Abdullah, Rusli
O. K. Rahmat, Rahmita Wirza
Hashem, Ibrahim Abaker Targio
Sangaiah, Arun Kumar
author_sort Alotaibi, Faiz
building UPM Institutional Repository
collection Online Access
description The detection and recognition and then conversion of the characters in an image into a text are called optical character recognition (OCR). A distinctive-type of OCR is used to process Arabic characters, namely, Arabic OCR. OCR is increasingly used in many applications, where this process is preferred to automatically perform a process without human association. The Quranic text contains two elements, namely, diacritics and characters. However, processing these elements may cause malfunction to the OCR system and reduce its level of accuracy. In this paper, a new method is proposed to check the similarity and originality of Quranic content. This method is based on a combination of Quranic diacritic and character recognition techniques. Diacritic detections are performed using a region-based algorithm. An optimization technique is applied to increase the recognition ratio. Moreover, character recognition is performed based on the projection method. An optimization technique is applied to increase the recognition ratio. The result of the proposed method is compared with the standard Mushaf al Madinah benchmark to find similarities that match with texts of the Holy Quran. The obtained accuracy was superior to the other tested K-nearest neighbor (knn) algorithm and published results in the literature. The accuracies were 96.4286% and 92.3077% better in the improved knn algorithm for diacritics and characters, respectively, than in the knn algorithm.
first_indexed 2025-11-15T12:00:23Z
format Article
id upm-75148
institution Universiti Putra Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T12:00:23Z
publishDate 2017
publisher Institute of Electrical and Electronics Engineers
recordtype eprints
repository_type Digital Repository
spelling upm-751482019-11-26T07:56:19Z http://psasir.upm.edu.my/id/eprint/75148/ Optical character recognition for Quranic image similarity matching Alotaibi, Faiz Abdullah, Muhamad Taufik Abdullah, Rusli O. K. Rahmat, Rahmita Wirza Hashem, Ibrahim Abaker Targio Sangaiah, Arun Kumar The detection and recognition and then conversion of the characters in an image into a text are called optical character recognition (OCR). A distinctive-type of OCR is used to process Arabic characters, namely, Arabic OCR. OCR is increasingly used in many applications, where this process is preferred to automatically perform a process without human association. The Quranic text contains two elements, namely, diacritics and characters. However, processing these elements may cause malfunction to the OCR system and reduce its level of accuracy. In this paper, a new method is proposed to check the similarity and originality of Quranic content. This method is based on a combination of Quranic diacritic and character recognition techniques. Diacritic detections are performed using a region-based algorithm. An optimization technique is applied to increase the recognition ratio. Moreover, character recognition is performed based on the projection method. An optimization technique is applied to increase the recognition ratio. The result of the proposed method is compared with the standard Mushaf al Madinah benchmark to find similarities that match with texts of the Holy Quran. The obtained accuracy was superior to the other tested K-nearest neighbor (knn) algorithm and published results in the literature. The accuracies were 96.4286% and 92.3077% better in the improved knn algorithm for diacritics and characters, respectively, than in the knn algorithm. Institute of Electrical and Electronics Engineers 2017-11-09 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/75148/1/Optical%20character%20recognition%20for%20Quranic%20image%20similarity%20matching.pdf Alotaibi, Faiz and Abdullah, Muhamad Taufik and Abdullah, Rusli and O. K. Rahmat, Rahmita Wirza and Hashem, Ibrahim Abaker Targio and Sangaiah, Arun Kumar (2017) Optical character recognition for Quranic image similarity matching. IEEE Access, 6. 554 - 562. ISSN 2169-3536 https://ieeexplore.ieee.org/document/8101474 10.1109/ACCESS.2017.2771621
spellingShingle Alotaibi, Faiz
Abdullah, Muhamad Taufik
Abdullah, Rusli
O. K. Rahmat, Rahmita Wirza
Hashem, Ibrahim Abaker Targio
Sangaiah, Arun Kumar
Optical character recognition for Quranic image similarity matching
title Optical character recognition for Quranic image similarity matching
title_full Optical character recognition for Quranic image similarity matching
title_fullStr Optical character recognition for Quranic image similarity matching
title_full_unstemmed Optical character recognition for Quranic image similarity matching
title_short Optical character recognition for Quranic image similarity matching
title_sort optical character recognition for quranic image similarity matching
url http://psasir.upm.edu.my/id/eprint/75148/
http://psasir.upm.edu.my/id/eprint/75148/
http://psasir.upm.edu.my/id/eprint/75148/
http://psasir.upm.edu.my/id/eprint/75148/1/Optical%20character%20recognition%20for%20Quranic%20image%20similarity%20matching.pdf