Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng

At present, text orientation is not diverse enough in the existing scene text datasets. For instance, text with curve-orientation has close to zero existence in them and thus received minimal attention from the community. Motivated by this phenomenon, a new scene text dataset, Total-Text, which emph...

Full description

Bibliographic Details
Main Author: Ch’ng, Chee Kheng
Format: Thesis
Published: 2018
Subjects:
Online Access:http://studentsrepo.um.edu.my/10702/
http://studentsrepo.um.edu.my/10702/2/Ch'ng_Chee_Kheng.pdf
http://studentsrepo.um.edu.my/10702/1/Ch%E2%80%99ng_Chee_Kheng_%E2%80%93_Dissertation.pdf
_version_ 1848774207421284352
author Ch’ng, Chee Kheng
author_facet Ch’ng, Chee Kheng
author_sort Ch’ng, Chee Kheng
building UM Research Repository
collection Online Access
description At present, text orientation is not diverse enough in the existing scene text datasets. For instance, text with curve-orientation has close to zero existence in them and thus received minimal attention from the community. Motivated by this phenomenon, a new scene text dataset, Total-Text, which emphasized on text orientations diversity has been collected as the major contribution of this work. It is the first properly scaled scene dataset that features three different text orientations: horizontal, multi-oriented, and curve-oriented. In addition, several studies regarding other important elements such as the practicality and quality of groundtruth, evaluation protocol, insights of curved text, and the annotation process are presented in this work as well. These elements are found to be as important as the images and groundtruth to facilitate a new research direction. In addition, Polygon- Faster-RCNN, a text detection baseline, has been proposed as the second major contribution of this work. It has demonstrated its ability in detecting text in all kinds of orientations. Images of Total-Text and its annotation are available at https://github.com/cschan/ Total-Text-Dataset.
first_indexed 2025-11-14T13:54:38Z
format Thesis
id um-10702
institution University Malaya
institution_category Local University
last_indexed 2025-11-14T13:54:38Z
publishDate 2018
recordtype eprints
repository_type Digital Repository
spelling um-107022020-01-18T02:36:14Z Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng Ch’ng, Chee Kheng QA75 Electronic computers. Computer science At present, text orientation is not diverse enough in the existing scene text datasets. For instance, text with curve-orientation has close to zero existence in them and thus received minimal attention from the community. Motivated by this phenomenon, a new scene text dataset, Total-Text, which emphasized on text orientations diversity has been collected as the major contribution of this work. It is the first properly scaled scene dataset that features three different text orientations: horizontal, multi-oriented, and curve-oriented. In addition, several studies regarding other important elements such as the practicality and quality of groundtruth, evaluation protocol, insights of curved text, and the annotation process are presented in this work as well. These elements are found to be as important as the images and groundtruth to facilitate a new research direction. In addition, Polygon- Faster-RCNN, a text detection baseline, has been proposed as the second major contribution of this work. It has demonstrated its ability in detecting text in all kinds of orientations. Images of Total-Text and its annotation are available at https://github.com/cschan/ Total-Text-Dataset. 2018-11 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/10702/2/Ch'ng_Chee_Kheng.pdf application/pdf http://studentsrepo.um.edu.my/10702/1/Ch%E2%80%99ng_Chee_Kheng_%E2%80%93_Dissertation.pdf Ch’ng, Chee Kheng (2018) Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng. Masters thesis, University of Malaya. http://studentsrepo.um.edu.my/10702/
spellingShingle QA75 Electronic computers. Computer science
Ch’ng, Chee Kheng
Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng
title Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng
title_full Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng
title_fullStr Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng
title_full_unstemmed Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng
title_short Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng
title_sort curved text detection and ground truth generation for natural scene images / ch’ng chee kheng
topic QA75 Electronic computers. Computer science
url http://studentsrepo.um.edu.my/10702/
http://studentsrepo.um.edu.my/10702/2/Ch'ng_Chee_Kheng.pdf
http://studentsrepo.um.edu.my/10702/1/Ch%E2%80%99ng_Chee_Kheng_%E2%80%93_Dissertation.pdf