Curved text detection and ground truth generation for natural scene images / Ch’ng Chee Kheng

At present, text orientation is not diverse enough in the existing scene text datasets. For instance, text with curve-orientation has close to zero existence in them and thus received minimal attention from the community. Motivated by this phenomenon, a new scene text dataset, Total-Text, which emph...

Full description

Bibliographic Details
Main Author: Ch’ng, Chee Kheng
Format: Thesis
Published: 2018
Subjects:
Online Access:http://studentsrepo.um.edu.my/10702/
http://studentsrepo.um.edu.my/10702/2/Ch'ng_Chee_Kheng.pdf
http://studentsrepo.um.edu.my/10702/1/Ch%E2%80%99ng_Chee_Kheng_%E2%80%93_Dissertation.pdf
Description
Summary:At present, text orientation is not diverse enough in the existing scene text datasets. For instance, text with curve-orientation has close to zero existence in them and thus received minimal attention from the community. Motivated by this phenomenon, a new scene text dataset, Total-Text, which emphasized on text orientations diversity has been collected as the major contribution of this work. It is the first properly scaled scene dataset that features three different text orientations: horizontal, multi-oriented, and curve-oriented. In addition, several studies regarding other important elements such as the practicality and quality of groundtruth, evaluation protocol, insights of curved text, and the annotation process are presented in this work as well. These elements are found to be as important as the images and groundtruth to facilitate a new research direction. In addition, Polygon- Faster-RCNN, a text detection baseline, has been proposed as the second major contribution of this work. It has demonstrated its ability in detecting text in all kinds of orientations. Images of Total-Text and its annotation are available at https://github.com/cschan/ Total-Text-Dataset.