Corner pixel-based method for selecting binary text in scene

Text detection in natural images is a process to indicate the location and presence of text appearing in images. The complexity of the background images, the similarity of text shapes to non-text objects, and the variability in text shapes and colours make automatic text detection in natural images...

Full description

Bibliographic Details
Main Authors: Ednawati Rainarli, Suprapto, Wahyono
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2024
Online Access:http://journalarticle.ukm.my/25039/
http://journalarticle.ukm.my/25039/1/221%20%E2%80%93%20231.pdf
_version_ 1848816251772600320
author Ednawati Rainarli,
Suprapto,
Wahyono,
author_facet Ednawati Rainarli,
Suprapto,
Wahyono,
author_sort Ednawati Rainarli,
building UKM Institutional Repository
collection Online Access
description Text detection in natural images is a process to indicate the location and presence of text appearing in images. The complexity of the background images, the similarity of text shapes to non-text objects, and the variability in text shapes and colours make automatic text detection in natural images challenging to achieve using traditional image processing techniques alone. The machine learning methods are one way to perform filtering to eliminate non-text candidates. We used secondary data as additional training data, such as in the ICDAR 2011, ICDAR 2013, and ICDAR 2015. The diversity of text colours in these datasets makes binary image processing not uniformly applicable to each image. Therefore, in this study, we proposed a method to process text images and automatically select between binary or negative binary images by checking pixels at the four corners of the binary and negative binary images. If the number of white pixels is greater than or equal to two, select the negative binary image; otherwise, select the binary image. This way, we automatically selected suitable images for feature extraction before using them to build text and non-text classification models. For low-resolution text images and digitally created text images, in ICDAR 2011, the accuracy of selecting binary text images reached 85.00%. For focused text taken with specific purposes and horizontal text appearances, like in ICDAR 2013, the accuracy of selected binary text images reached up to 92.10%. The accuracy of binary text image selection reached 66.67% for incidental text with multi-oriented text positions. Based on the research results, the proposed strategy can work optimally, especially for focused text with various colours, including white or black coloured text, with diverse sizes and types of text.
first_indexed 2025-11-15T01:02:54Z
format Article
id oai:generic.eprints.org:25039
institution Universiti Kebangasaan Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T01:02:54Z
publishDate 2024
publisher Penerbit Universiti Kebangsaan Malaysia
recordtype eprints
repository_type Digital Repository
spelling oai:generic.eprints.org:250392025-04-08T04:27:37Z http://journalarticle.ukm.my/25039/ Corner pixel-based method for selecting binary text in scene Ednawati Rainarli, Suprapto, Wahyono, Text detection in natural images is a process to indicate the location and presence of text appearing in images. The complexity of the background images, the similarity of text shapes to non-text objects, and the variability in text shapes and colours make automatic text detection in natural images challenging to achieve using traditional image processing techniques alone. The machine learning methods are one way to perform filtering to eliminate non-text candidates. We used secondary data as additional training data, such as in the ICDAR 2011, ICDAR 2013, and ICDAR 2015. The diversity of text colours in these datasets makes binary image processing not uniformly applicable to each image. Therefore, in this study, we proposed a method to process text images and automatically select between binary or negative binary images by checking pixels at the four corners of the binary and negative binary images. If the number of white pixels is greater than or equal to two, select the negative binary image; otherwise, select the binary image. This way, we automatically selected suitable images for feature extraction before using them to build text and non-text classification models. For low-resolution text images and digitally created text images, in ICDAR 2011, the accuracy of selecting binary text images reached 85.00%. For focused text taken with specific purposes and horizontal text appearances, like in ICDAR 2013, the accuracy of selected binary text images reached up to 92.10%. The accuracy of binary text image selection reached 66.67% for incidental text with multi-oriented text positions. Based on the research results, the proposed strategy can work optimally, especially for focused text with various colours, including white or black coloured text, with diverse sizes and types of text. Penerbit Universiti Kebangsaan Malaysia 2024-10-09 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/25039/1/221%20%E2%80%93%20231.pdf Ednawati Rainarli, and Suprapto, and Wahyono, (2024) Corner pixel-based method for selecting binary text in scene. Asia-Pacific Journal of Information Technology and Multimedia, 13 (2). pp. 221-231. ISSN 2289-2192 https://www.ukm.my/apjitm/
spellingShingle Ednawati Rainarli,
Suprapto,
Wahyono,
Corner pixel-based method for selecting binary text in scene
title Corner pixel-based method for selecting binary text in scene
title_full Corner pixel-based method for selecting binary text in scene
title_fullStr Corner pixel-based method for selecting binary text in scene
title_full_unstemmed Corner pixel-based method for selecting binary text in scene
title_short Corner pixel-based method for selecting binary text in scene
title_sort corner pixel-based method for selecting binary text in scene
url http://journalarticle.ukm.my/25039/
http://journalarticle.ukm.my/25039/
http://journalarticle.ukm.my/25039/1/221%20%E2%80%93%20231.pdf