An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab

Dysarthria is a motor speech impairment at the neurological and/or muscular levels that caused difficulty in pronouncing words clearly. Automatic speech recognition (ASR) system is increasingly applied as assistive technology to aid an individual with physical disability particularly the speech impa...

Full description

Bibliographic Details
Main Author: Bassam Ali Qasem, Al-Qatab
Format: Thesis
Published: 2020
Subjects:
Online Access:http://studentsrepo.um.edu.my/14484/
http://studentsrepo.um.edu.my/14484/1/Bassam_Ali.pdf
http://studentsrepo.um.edu.my/14484/2/Bassam_Ali.pdf
_version_ 1848774979764617216
author Bassam Ali Qasem, Al-Qatab
author_facet Bassam Ali Qasem, Al-Qatab
author_sort Bassam Ali Qasem, Al-Qatab
building UM Research Repository
collection Online Access
description Dysarthria is a motor speech impairment at the neurological and/or muscular levels that caused difficulty in pronouncing words clearly. Automatic speech recognition (ASR) system is increasingly applied as assistive technology to aid an individual with physical disability particularly the speech impaired community such as dysarthria speakers. However, the development of an effective ASR system is hindered by the data sparsity, either in the coverage of the language or the size of the existing speech databases. The speaker adaptation (SA) technique is one of the solutions to overcome the data sparsity issue of ASR for dysarthric speakers. Our proposed method introduces the intra-severity classification and adaptation techniques which are applied sequentially in two stages of system development. Firstly, intra-severity classification intended to identify the level of severity of the dysarthric speakers. Secondly, the identified severity level of a particular dysarthric speaker in the first stage is applied to the corresponding intra-severity adaptation of dysarthric speech. For the classification part, there are six algorithms used to classify the intra-severity of dysarthric speakers. The algorithms include Linear Discriminant Analysis (LDA), Artificial Neural Network (ANN), Support Vector Machine (SVM), Naive Bayes (NB), Classification And Regression Tree (CART), Random Forest (RF). The Random Forest (RF) algorithm was proposed as a classifier for the intra-severity classification of the dysarthric speaker which has the lowest average ranking score as compared to other benchmark classifiers. The intra-severity adaptation of the ASR system was developed using two well-known adaptation techniques which are the Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP) as well as a combination of them. The results showed that the combination of MLLR+MAP adaptation outperforms all adaptation techniques with total improvement in Word Error Rate (WER) from 39.84% to 18.48% with 53.61% improvement from the baseline WER in the overall performance of the system. The total improvement of the WER based on severity level were 66.32%, 52.35%, and 45.20% for mild, moderate, and severe severity level respectively for the hybrid MLLR+MAP adaptation technique. The combination of the adaptation techniques in sequential order helps to take advantage of each adaptation technique and avoid the flaws of each technique in relation to adaptation data size.
first_indexed 2025-11-14T14:06:54Z
format Thesis
id um-14484
institution University Malaya
institution_category Local University
last_indexed 2025-11-14T14:06:54Z
publishDate 2020
recordtype eprints
repository_type Digital Repository
spelling um-144842023-06-22T00:05:50Z An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab Bassam Ali Qasem, Al-Qatab QA75 Electronic computers. Computer science Dysarthria is a motor speech impairment at the neurological and/or muscular levels that caused difficulty in pronouncing words clearly. Automatic speech recognition (ASR) system is increasingly applied as assistive technology to aid an individual with physical disability particularly the speech impaired community such as dysarthria speakers. However, the development of an effective ASR system is hindered by the data sparsity, either in the coverage of the language or the size of the existing speech databases. The speaker adaptation (SA) technique is one of the solutions to overcome the data sparsity issue of ASR for dysarthric speakers. Our proposed method introduces the intra-severity classification and adaptation techniques which are applied sequentially in two stages of system development. Firstly, intra-severity classification intended to identify the level of severity of the dysarthric speakers. Secondly, the identified severity level of a particular dysarthric speaker in the first stage is applied to the corresponding intra-severity adaptation of dysarthric speech. For the classification part, there are six algorithms used to classify the intra-severity of dysarthric speakers. The algorithms include Linear Discriminant Analysis (LDA), Artificial Neural Network (ANN), Support Vector Machine (SVM), Naive Bayes (NB), Classification And Regression Tree (CART), Random Forest (RF). The Random Forest (RF) algorithm was proposed as a classifier for the intra-severity classification of the dysarthric speaker which has the lowest average ranking score as compared to other benchmark classifiers. The intra-severity adaptation of the ASR system was developed using two well-known adaptation techniques which are the Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP) as well as a combination of them. The results showed that the combination of MLLR+MAP adaptation outperforms all adaptation techniques with total improvement in Word Error Rate (WER) from 39.84% to 18.48% with 53.61% improvement from the baseline WER in the overall performance of the system. The total improvement of the WER based on severity level were 66.32%, 52.35%, and 45.20% for mild, moderate, and severe severity level respectively for the hybrid MLLR+MAP adaptation technique. The combination of the adaptation techniques in sequential order helps to take advantage of each adaptation technique and avoid the flaws of each technique in relation to adaptation data size. 2020-01 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/14484/1/Bassam_Ali.pdf application/pdf http://studentsrepo.um.edu.my/14484/2/Bassam_Ali.pdf Bassam Ali Qasem, Al-Qatab (2020) An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab. PhD thesis, Universiti Malaya. http://studentsrepo.um.edu.my/14484/
spellingShingle QA75 Electronic computers. Computer science
Bassam Ali Qasem, Al-Qatab
An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_full An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_fullStr An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_full_unstemmed An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_short An intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / Bassam Ali Qasem Al-Qatab
title_sort intra-severity classification and adaptation technique to improve dysarthric speech recognition accuracy / bassam ali qasem al-qatab
topic QA75 Electronic computers. Computer science
url http://studentsrepo.um.edu.my/14484/
http://studentsrepo.um.edu.my/14484/1/Bassam_Ali.pdf
http://studentsrepo.um.edu.my/14484/2/Bassam_Ali.pdf