Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach

Offline handwriting recognition is a long existing approach to identify the handwritten phrase, letters or digits. Earlier studies in the handwriting recognition field were mostly focused on recognizing characters using Neural Network Language Model (NNLM) classifier, Hidden Markov Model (HMM)...

Full description

Bibliographic Details
Main Author: Munnian, Ruthrakumar
Format: Monograph
Language:English
Published: Universiti Sains Malaysia 2019
Subjects:
Online Access:http://eprints.usm.my/58271/
http://eprints.usm.my/58271/1/Analysis%20Of%20Failure%20In%20Offline%20English%20Alphabet%20Recognition%20With%20Data%20Mining%20Approach.pdf
_version_ 1848883854361755648
author Munnian, Ruthrakumar
author_facet Munnian, Ruthrakumar
author_sort Munnian, Ruthrakumar
building USM Institutional Repository
collection Online Access
description Offline handwriting recognition is a long existing approach to identify the handwritten phrase, letters or digits. Earlier studies in the handwriting recognition field were mostly focused on recognizing characters using Neural Network Language Model (NNLM) classifier, Hidden Markov Model (HMM), and Support Vector Machine (SVM) with segmentation technique, Hough Transform method, and structural features. However, these approaches involve complex algorithms and require voluminous dataset as the training model. Therefore, this study attempts a data mining approach to the analysis of failure in offline English alphabet recognition. The objectives of the study are to improve the pattern recognition approach for classifying English alphabets and to determine the root of classification failure in handwritten English alphabets. Handwritten data of capital letters of the English alphabet by 50 Universiti Sains Malaysia student experimented. The data was pre-processed to remove the outliers prior to classification analysis with the aid of the Waikato Environment for Knowledge Analysis (WEKA) tool. Classification analysis was initially performed on all seven classifier’s algorithms at 10-fold dross validation mode. At phase one, Stroke and Curve are added into the dataset and classified respectively. At phase two, Sharp Vertex, Closed Region, and Points are added in the dataset. The top three classification algorithms were selected: IBk, LMT and Random Committee for further classification. The classified result was further analyzed to identify the root of classification errors. At the raw dataset classification, the classification accuracy is low with 25%. As the attributes are added to raw dataset respectively, the accuracy of classification was successfully increased to 89%. Conclusively, the accuracy of the classification depends on the added attributes to distinguish characteristics of the alphabets.
first_indexed 2025-11-15T18:57:25Z
format Monograph
id usm-58271
institution Universiti Sains Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T18:57:25Z
publishDate 2019
publisher Universiti Sains Malaysia
recordtype eprints
repository_type Digital Repository
spelling usm-582712023-04-28T08:08:04Z http://eprints.usm.my/58271/ Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach Munnian, Ruthrakumar T Technology T351-385 Mechanical drawing. Engineering graphics Offline handwriting recognition is a long existing approach to identify the handwritten phrase, letters or digits. Earlier studies in the handwriting recognition field were mostly focused on recognizing characters using Neural Network Language Model (NNLM) classifier, Hidden Markov Model (HMM), and Support Vector Machine (SVM) with segmentation technique, Hough Transform method, and structural features. However, these approaches involve complex algorithms and require voluminous dataset as the training model. Therefore, this study attempts a data mining approach to the analysis of failure in offline English alphabet recognition. The objectives of the study are to improve the pattern recognition approach for classifying English alphabets and to determine the root of classification failure in handwritten English alphabets. Handwritten data of capital letters of the English alphabet by 50 Universiti Sains Malaysia student experimented. The data was pre-processed to remove the outliers prior to classification analysis with the aid of the Waikato Environment for Knowledge Analysis (WEKA) tool. Classification analysis was initially performed on all seven classifier’s algorithms at 10-fold dross validation mode. At phase one, Stroke and Curve are added into the dataset and classified respectively. At phase two, Sharp Vertex, Closed Region, and Points are added in the dataset. The top three classification algorithms were selected: IBk, LMT and Random Committee for further classification. The classified result was further analyzed to identify the root of classification errors. At the raw dataset classification, the classification accuracy is low with 25%. As the attributes are added to raw dataset respectively, the accuracy of classification was successfully increased to 89%. Conclusively, the accuracy of the classification depends on the added attributes to distinguish characteristics of the alphabets. Universiti Sains Malaysia 2019-06-01 Monograph NonPeerReviewed application/pdf en http://eprints.usm.my/58271/1/Analysis%20Of%20Failure%20In%20Offline%20English%20Alphabet%20Recognition%20With%20Data%20Mining%20Approach.pdf Munnian, Ruthrakumar (2019) Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach. Project Report. Universiti Sains Malaysia, Pusat Pengajian Kejuruteraan Mekanik. (Submitted)
spellingShingle T Technology
T351-385 Mechanical drawing. Engineering graphics
Munnian, Ruthrakumar
Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach
title Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach
title_full Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach
title_fullStr Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach
title_full_unstemmed Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach
title_short Analysis Of Failure In Offline English Alphabet Recognition With Data Mining Approach
title_sort analysis of failure in offline english alphabet recognition with data mining approach
topic T Technology
T351-385 Mechanical drawing. Engineering graphics
url http://eprints.usm.my/58271/
http://eprints.usm.my/58271/1/Analysis%20Of%20Failure%20In%20Offline%20English%20Alphabet%20Recognition%20With%20Data%20Mining%20Approach.pdf