Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases

In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses...

Full description

Bibliographic Details
Main Authors: Ahmad Qadri, Syed Asif, Gunawan, Teddy Surya, Kartiwi, Mira, Mansor, Hasmah
Format: Book Chapter
Language:English
English
Published: Springer 2020
Subjects:
Online Access:http://irep.iium.edu.my/82534/
http://irep.iium.edu.my/82534/1/Paper_182.pdf
http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf
_version_ 1848789314318630912
author Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Kartiwi, Mira
Mansor, Hasmah
author_facet Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Kartiwi, Mira
Mansor, Hasmah
author_sort Ahmad Qadri, Syed Asif
building IIUM Repository
collection Online Access
description In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses on extracting features such as pitch frequency, formant features, energy-related features, and spectral features from speech, tailing it with a classification quest to foresee different clas-ses of emotion. The critical issue to be addressed for a successful SER system is the emotional feature extraction, which can be solved by using different feature extraction techniques. In this paper, along with Teager Energy Operator (TEO) and Mel Frequency Cepstral Coefficients (MFCC) a trailblazing feature extrac-tion method, a fusion of MFCC and TEO as Teager-MFCC (T-MFCC) is used for the recognition of energy-based emotions. We have used three corpora of emotions in German, English, and Hindi to develop the multilingual SER system. The classification of these energy-based emotions is done by Deep Neural Net-work (DNN). It is found that TEO achieves a better recognition rate compared to MFCC and T-MFCC.
first_indexed 2025-11-14T17:54:45Z
format Book Chapter
id iium-82534
institution International Islamic University Malaysia
institution_category Local University
language English
English
last_indexed 2025-11-14T17:54:45Z
publishDate 2020
publisher Springer
recordtype eprints
repository_type Digital Repository
spelling iium-825342020-09-18T02:58:06Z http://irep.iium.edu.my/82534/ Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases Ahmad Qadri, Syed Asif Gunawan, Teddy Surya Kartiwi, Mira Mansor, Hasmah TK7885 Computer engineering In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses on extracting features such as pitch frequency, formant features, energy-related features, and spectral features from speech, tailing it with a classification quest to foresee different clas-ses of emotion. The critical issue to be addressed for a successful SER system is the emotional feature extraction, which can be solved by using different feature extraction techniques. In this paper, along with Teager Energy Operator (TEO) and Mel Frequency Cepstral Coefficients (MFCC) a trailblazing feature extrac-tion method, a fusion of MFCC and TEO as Teager-MFCC (T-MFCC) is used for the recognition of energy-based emotions. We have used three corpora of emotions in German, English, and Hindi to develop the multilingual SER system. The classification of these energy-based emotions is done by Deep Neural Net-work (DNN). It is found that TEO achieves a better recognition rate compared to MFCC and T-MFCC. Springer 2020-07 Book Chapter NonPeerReviewed application/pdf en http://irep.iium.edu.my/82534/1/Paper_182.pdf application/pdf en http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf Ahmad Qadri, Syed Asif and Gunawan, Teddy Surya and Kartiwi, Mira and Mansor, Hasmah (2020) Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases. In: Springer's Lecture Nores in Electrical Engineering (LNEE). Springer, pp. 1-10. (In Press)
spellingShingle TK7885 Computer engineering
Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Kartiwi, Mira
Mansor, Hasmah
Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_full Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_fullStr Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_full_unstemmed Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_short Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_sort speech emotion recognition using feature fusion of teo and mfcc on multilingual databases
topic TK7885 Computer engineering
url http://irep.iium.edu.my/82534/
http://irep.iium.edu.my/82534/1/Paper_182.pdf
http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf