Corner pixel-based method for selecting binary text in scene

Text detection in natural images is a process to indicate the location and presence of text appearing in images. The complexity of the background images, the similarity of text shapes to non-text objects, and the variability in text shapes and colours make automatic text detection in natural images...

Full description

Bibliographic Details
Main Authors:	Ednawati Rainarli, Suprapto, Wahyono
Format:	Article
Language:	English
Published:	Penerbit Universiti Kebangsaan Malaysia 2024
Online Access:	http://journalarticle.ukm.my/25039/ http://journalarticle.ukm.my/25039/1/221%20%E2%80%93%20231.pdf

_version_	1848816251772600320
author	Ednawati Rainarli, Suprapto, Wahyono,
author_facet	Ednawati Rainarli, Suprapto, Wahyono,
author_sort	Ednawati Rainarli,
building	UKM Institutional Repository
collection	Online Access
description	Text detection in natural images is a process to indicate the location and presence of text appearing in images. The complexity of the background images, the similarity of text shapes to non-text objects, and the variability in text shapes and colours make automatic text detection in natural images challenging to achieve using traditional image processing techniques alone. The machine learning methods are one way to perform filtering to eliminate non-text candidates. We used secondary data as additional training data, such as in the ICDAR 2011, ICDAR 2013, and ICDAR 2015. The diversity of text colours in these datasets makes binary image processing not uniformly applicable to each image. Therefore, in this study, we proposed a method to process text images and automatically select between binary or negative binary images by checking pixels at the four corners of the binary and negative binary images. If the number of white pixels is greater than or equal to two, select the negative binary image; otherwise, select the binary image. This way, we automatically selected suitable images for feature extraction before using them to build text and non-text classification models. For low-resolution text images and digitally created text images, in ICDAR 2011, the accuracy of selecting binary text images reached 85.00%. For focused text taken with specific purposes and horizontal text appearances, like in ICDAR 2013, the accuracy of selected binary text images reached up to 92.10%. The accuracy of binary text image selection reached 66.67% for incidental text with multi-oriented text positions. Based on the research results, the proposed strategy can work optimally, especially for focused text with various colours, including white or black coloured text, with diverse sizes and types of text.
first_indexed	2025-11-15T01:02:54Z
format	Article
id	oai:generic.eprints.org:25039
institution	Universiti Kebangasaan Malaysia
institution_category	Local University
language	English
last_indexed	2025-11-15T01:02:54Z
publishDate	2024
publisher	Penerbit Universiti Kebangsaan Malaysia
recordtype	eprints
repository_type	Digital Repository
spelling	oai:generic.eprints.org:250392025-04-08T04:27:37Z http://journalarticle.ukm.my/25039/ Corner pixel-based method for selecting binary text in scene Ednawati Rainarli, Suprapto, Wahyono, Text detection in natural images is a process to indicate the location and presence of text appearing in images. The complexity of the background images, the similarity of text shapes to non-text objects, and the variability in text shapes and colours make automatic text detection in natural images challenging to achieve using traditional image processing techniques alone. The machine learning methods are one way to perform filtering to eliminate non-text candidates. We used secondary data as additional training data, such as in the ICDAR 2011, ICDAR 2013, and ICDAR 2015. The diversity of text colours in these datasets makes binary image processing not uniformly applicable to each image. Therefore, in this study, we proposed a method to process text images and automatically select between binary or negative binary images by checking pixels at the four corners of the binary and negative binary images. If the number of white pixels is greater than or equal to two, select the negative binary image; otherwise, select the binary image. This way, we automatically selected suitable images for feature extraction before using them to build text and non-text classification models. For low-resolution text images and digitally created text images, in ICDAR 2011, the accuracy of selecting binary text images reached 85.00%. For focused text taken with specific purposes and horizontal text appearances, like in ICDAR 2013, the accuracy of selected binary text images reached up to 92.10%. The accuracy of binary text image selection reached 66.67% for incidental text with multi-oriented text positions. Based on the research results, the proposed strategy can work optimally, especially for focused text with various colours, including white or black coloured text, with diverse sizes and types of text. Penerbit Universiti Kebangsaan Malaysia 2024-10-09 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/25039/1/221%20%E2%80%93%20231.pdf Ednawati Rainarli, and Suprapto, and Wahyono, (2024) Corner pixel-based method for selecting binary text in scene. Asia-Pacific Journal of Information Technology and Multimedia, 13 (2). pp. 221-231. ISSN 2289-2192 https://www.ukm.my/apjitm/
spellingShingle	Ednawati Rainarli, Suprapto, Wahyono, Corner pixel-based method for selecting binary text in scene
title	Corner pixel-based method for selecting binary text in scene
title_full	Corner pixel-based method for selecting binary text in scene
title_fullStr	Corner pixel-based method for selecting binary text in scene
title_full_unstemmed	Corner pixel-based method for selecting binary text in scene
title_short	Corner pixel-based method for selecting binary text in scene
title_sort	corner pixel-based method for selecting binary text in scene
url	http://journalarticle.ukm.my/25039/ http://journalarticle.ukm.my/25039/ http://journalarticle.ukm.my/25039/1/221%20%E2%80%93%20231.pdf

Corner pixel-based method for selecting binary text in scene

Similar Items