An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab

Dysarthria is a motor speech impairment at the neurological and/or muscular levels that caused difficulty in pronouncing words clearly. Automatic speech recognition (ASR) system is increasingly applied as assistive technology to aid an individual with physical disability particularly the speech impa...

Full description

Bibliographic Details
Main Author:	Bassam Ali Qasem, Al-Qatab
Format:	Thesis
Published:	2020
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://studentsrepo.um.edu.my/14484/ http://studentsrepo.um.edu.my/14484/1/Bassam_Ali.pdf http://studentsrepo.um.edu.my/14484/2/Bassam_Ali.pdf

_version_	1848774979764617216
author	Bassam Ali Qasem, Al-Qatab
author_facet	Bassam Ali Qasem, Al-Qatab
author_sort	Bassam Ali Qasem, Al-Qatab
building	UM Research Repository
collection	Online Access
description	Dysarthria is a motor speech impairment at the neurological and/or muscular levels that caused difficulty in pronouncing words clearly. Automatic speech recognition (ASR) system is increasingly applied as assistive technology to aid an individual with physical disability particularly the speech impaired community such as dysarthria speakers. However, the development of an effective ASR system is hindered by the data sparsity, either in the coverage of the language or the size of the existing speech databases. The speaker adaptation (SA) technique is one of the solutions to overcome the data sparsity issue of ASR for dysarthric speakers. Our proposed method introduces the intra-severity classification and adaptation techniques which are applied sequentially in two stages of system development. Firstly, intra-severity classification intended to identify the level of severity of the dysarthric speakers. Secondly, the identified severity level of a particular dysarthric speaker in the first stage is applied to the corresponding intra-severity adaptation of dysarthric speech. For the classification part, there are six algorithms used to classify the intra-severity of dysarthric speakers. The algorithms include Linear Discriminant Analysis (LDA), Artificial Neural Network (ANN), Support Vector Machine (SVM), Naive Bayes (NB), Classification And Regression Tree (CART), Random Forest (RF). The Random Forest (RF) algorithm was proposed as a classifier for the intra-severity classification of the dysarthric speaker which has the lowest average ranking score as compared to other benchmark classifiers. The intra-severity adaptation of the ASR system was developed using two well-known adaptation techniques which are the Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP) as well as a combination of them. The results showed that the combination of MLLR+MAP adaptation outperforms all adaptation techniques with total improvement in Word Error Rate (WER) from 39.84% to 18.48% with 53.61% improvement from the baseline WER in the overall performance of the system. The total improvement of the WER based on severity level were 66.32%, 52.35%, and 45.20% for mild, moderate, and severe severity level respectively for the hybrid MLLR+MAP adaptation technique. The combination of the adaptation techniques in sequential order helps to take advantage of each adaptation technique and avoid the flaws of each technique in relation to adaptation data size.
first_indexed	2025-11-14T14:06:54Z
format	Thesis
id	um-14484
institution	University Malaya
institution_category	Local University
last_indexed	2025-11-14T14:06:54Z
publishDate	2020
recordtype	eprints
repository_type	Digital Repository
spelling	um-144842023-06-22T00:05:50Z An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab Bassam Ali Qasem, Al-Qatab QA75 Electronic computers. Computer science Dysarthria is a motor speech impairment at the neurological and/or muscular levels that caused difficulty in pronouncing words clearly. Automatic speech recognition (ASR) system is increasingly applied as assistive technology to aid an individual with physical disability particularly the speech impaired community such as dysarthria speakers. However, the development of an effective ASR system is hindered by the data sparsity, either in the coverage of the language or the size of the existing speech databases. The speaker adaptation (SA) technique is one of the solutions to overcome the data sparsity issue of ASR for dysarthric speakers. Our proposed method introduces the intra-severity classification and adaptation techniques which are applied sequentially in two stages of system development. Firstly, intra-severity classification intended to identify the level of severity of the dysarthric speakers. Secondly, the identified severity level of a particular dysarthric speaker in the first stage is applied to the corresponding intra-severity adaptation of dysarthric speech. For the classification part, there are six algorithms used to classify the intra-severity of dysarthric speakers. The algorithms include Linear Discriminant Analysis (LDA), Artificial Neural Network (ANN), Support Vector Machine (SVM), Naive Bayes (NB), Classification And Regression Tree (CART), Random Forest (RF). The Random Forest (RF) algorithm was proposed as a classifier for the intra-severity classification of the dysarthric speaker which has the lowest average ranking score as compared to other benchmark classifiers. The intra-severity adaptation of the ASR system was developed using two well-known adaptation techniques which are the Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP) as well as a combination of them. The results showed that the combination of MLLR+MAP adaptation outperforms all adaptation techniques with total improvement in Word Error Rate (WER) from 39.84% to 18.48% with 53.61% improvement from the baseline WER in the overall performance of the system. The total improvement of the WER based on severity level were 66.32%, 52.35%, and 45.20% for mild, moderate, and severe severity level respectively for the hybrid MLLR+MAP adaptation technique. The combination of the adaptation techniques in sequential order helps to take advantage of each adaptation technique and avoid the flaws of each technique in relation to adaptation data size. 2020-01 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/14484/1/Bassam_Ali.pdf application/pdf http://studentsrepo.um.edu.my/14484/2/Bassam_Ali.pdf Bassam Ali Qasem, Al-Qatab (2020) An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab. PhD thesis, Universiti Malaya. http://studentsrepo.um.edu.my/14484/
spellingShingle	QA75 Electronic computers. Computer science Bassam Ali Qasem, Al-Qatab An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title	An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_full	An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_fullStr	An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_full_unstemmed	An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_short	An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_sort	intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / bassam ali qasem al-qatab
topic	QA75 Electronic computers. Computer science
url	http://studentsrepo.um.edu.my/14484/ http://studentsrepo.um.edu.my/14484/1/Bassam_Ali.pdf http://studentsrepo.um.edu.my/14484/2/Bassam_Ali.pdf

An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab

Similar Items