Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN

Speech emotion recognition field is growing due to the increasing needs for effective human-computer interaction. There are many approaches in term of features extraction methods coupled with classifiers to obtain optimum performance. However, none can claim superiority as it is very data-depen...

Full description

Bibliographic Details
Main Authors: Kamaruddin, Norhaslinda, Abdul Rahman, Abdul Wahab
Format: Proceeding Paper
Language:English
English
Published: 2013
Subjects:
Online Access:http://irep.iium.edu.my/31010/
http://irep.iium.edu.my/31010/1/Table_of_Content.pdf
http://irep.iium.edu.my/31010/2/106.pdf
_version_ 1848780368527753216
author Kamaruddin, Norhaslinda
Abdul Rahman, Abdul Wahab
author_facet Kamaruddin, Norhaslinda
Abdul Rahman, Abdul Wahab
author_sort Kamaruddin, Norhaslinda
building IIUM Repository
collection Online Access
description Speech emotion recognition field is growing due to the increasing needs for effective human-computer interaction. There are many approaches in term of features extraction methods coupled with classifiers to obtain optimum performance. However, none can claim superiority as it is very data-dependant and domain oriented. In this paper, the appropriate sets of features are investigated using segregation method and feature ranking algorithm of Automatic Relevance Determination (ARD) [1]. Two popular classifiers of Multi Layer Perceptron (MLP) [2] and Generic Self-organizing Fuzzy Neural Network (GenSoFNN) [3] are employed to discriminate emotions in the data corpus used in the FAU Aibo Emotion Corpus [4, 5]. The experimental results shows that Mel Frequency Cepstral Coefficient (MFCC) [6] features are able to yield comparable accuracy with baseline result [5]. In addition, it is observed that MLP can perform slightly better than GenSoFNN. Hence, such system envisages that appropriate combination of features extracted with good classifiers is fundamental for the good speech emotion recognition system.
first_indexed 2025-11-14T15:32:34Z
format Proceeding Paper
id iium-31010
institution International Islamic University Malaysia
institution_category Local University
language English
English
last_indexed 2025-11-14T15:32:34Z
publishDate 2013
recordtype eprints
repository_type Digital Repository
spelling iium-310102020-12-16T17:02:28Z http://irep.iium.edu.my/31010/ Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN Kamaruddin, Norhaslinda Abdul Rahman, Abdul Wahab BF511 Affection. Feeling. Emotion H61.8 Communication of information T58.5 Information technology Speech emotion recognition field is growing due to the increasing needs for effective human-computer interaction. There are many approaches in term of features extraction methods coupled with classifiers to obtain optimum performance. However, none can claim superiority as it is very data-dependant and domain oriented. In this paper, the appropriate sets of features are investigated using segregation method and feature ranking algorithm of Automatic Relevance Determination (ARD) [1]. Two popular classifiers of Multi Layer Perceptron (MLP) [2] and Generic Self-organizing Fuzzy Neural Network (GenSoFNN) [3] are employed to discriminate emotions in the data corpus used in the FAU Aibo Emotion Corpus [4, 5]. The experimental results shows that Mel Frequency Cepstral Coefficient (MFCC) [6] features are able to yield comparable accuracy with baseline result [5]. In addition, it is observed that MLP can perform slightly better than GenSoFNN. Hence, such system envisages that appropriate combination of features extracted with good classifiers is fundamental for the good speech emotion recognition system. 2013 Proceeding Paper PeerReviewed application/pdf en http://irep.iium.edu.my/31010/1/Table_of_Content.pdf application/pdf en http://irep.iium.edu.my/31010/2/106.pdf Kamaruddin, Norhaslinda and Abdul Rahman, Abdul Wahab (2013) Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN. In: The 4th International Conference on Information & Communication Technology for the Muslim World (ICT4M), 25-27 Mar 2013, Rabat, Moroco. http://ict4m.org/
spellingShingle BF511 Affection. Feeling. Emotion
H61.8 Communication of information
T58.5 Information technology
Kamaruddin, Norhaslinda
Abdul Rahman, Abdul Wahab
Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN
title Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN
title_full Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN
title_fullStr Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN
title_full_unstemmed Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN
title_short Emulating human cognitive approach for speech emotion recognition using MLP and GenSoFNN
title_sort emulating human cognitive approach for speech emotion recognition using mlp and gensofnn
topic BF511 Affection. Feeling. Emotion
H61.8 Communication of information
T58.5 Information technology
url http://irep.iium.edu.my/31010/
http://irep.iium.edu.my/31010/
http://irep.iium.edu.my/31010/1/Table_of_Content.pdf
http://irep.iium.edu.my/31010/2/106.pdf