Determinism in speech pitch relation to emotion

Emotional speech synthesis is traditionally achieved using time-pitch manipulation of the synthesized acoustic waveform. Rule-based approaches rely on rules that describe the behavior of the pitch frequency along time to generate time-pitch values. Pitch values fluctuate within a certain range depen...

Full description

Bibliographic Details
Main Authors: Ahmed Mustafa Mahmoud, *, Wan, Haslina Hassan*
Format: Article
Published: Association of Computing Machinery 2009
Subjects:
Online Access:http://eprints.sunway.edu.my/60/
_version_ 1848801734958252032
author Ahmed Mustafa Mahmoud, *
Wan, Haslina Hassan*
author_facet Ahmed Mustafa Mahmoud, *
Wan, Haslina Hassan*
author_sort Ahmed Mustafa Mahmoud, *
building SU Institutional Repository
collection Online Access
description Emotional speech synthesis is traditionally achieved using time-pitch manipulation of the synthesized acoustic waveform. Rule-based approaches rely on rules that describe the behavior of the pitch frequency along time to generate time-pitch values. Pitch values fluctuate within a certain range depending on the intended emotion. Recent studies in emotional cognitive psychology have shown that a slight 4 Hz modification of pitch frequency is sufficient to make significant change in the emotional state of speech. Existing rule-based approaches neglects this determinism by relying on statistical approaches, thus increasing the probability of error. In this paper, a deterministic approach to emotional speech rule-based synthesis algorithm is presented. This approach relies on mapping the pitch frequency values to the 12 semitone melodic scale and extracting semitonic intervals for each emotional state. Using praat analysis tool, emotional speech samples are analyzed and semitonic intervals are extracted. An objective evaluation was used to determine the accuracy of this approach by comparing the simulated speech to natural speech under the intended emotion. Results show that this approach has marked improvements with a low mean square error of no more than 2.65 semitones.
first_indexed 2025-11-14T21:12:10Z
format Article
id sunway-60
institution Sunway University
institution_category Local University
last_indexed 2025-11-14T21:12:10Z
publishDate 2009
publisher Association of Computing Machinery
recordtype eprints
repository_type Digital Repository
spelling sunway-602019-05-14T07:49:01Z http://eprints.sunway.edu.my/60/ Determinism in speech pitch relation to emotion Ahmed Mustafa Mahmoud, * Wan, Haslina Hassan* BF Psychology P Philology. Linguistics Emotional speech synthesis is traditionally achieved using time-pitch manipulation of the synthesized acoustic waveform. Rule-based approaches rely on rules that describe the behavior of the pitch frequency along time to generate time-pitch values. Pitch values fluctuate within a certain range depending on the intended emotion. Recent studies in emotional cognitive psychology have shown that a slight 4 Hz modification of pitch frequency is sufficient to make significant change in the emotional state of speech. Existing rule-based approaches neglects this determinism by relying on statistical approaches, thus increasing the probability of error. In this paper, a deterministic approach to emotional speech rule-based synthesis algorithm is presented. This approach relies on mapping the pitch frequency values to the 12 semitone melodic scale and extracting semitonic intervals for each emotional state. Using praat analysis tool, emotional speech samples are analyzed and semitonic intervals are extracted. An objective evaluation was used to determine the accuracy of this approach by comparing the simulated speech to natural speech under the intended emotion. Results show that this approach has marked improvements with a low mean square error of no more than 2.65 semitones. Association of Computing Machinery 2009 Article PeerReviewed Ahmed Mustafa Mahmoud, * and Wan, Haslina Hassan* (2009) Determinism in speech pitch relation to emotion. Determinism in speech pitch relation to emotion (403). pp. 32-37. http://dx.doi.org/10.1145/1655925.1655931
spellingShingle BF Psychology
P Philology. Linguistics
Ahmed Mustafa Mahmoud, *
Wan, Haslina Hassan*
Determinism in speech pitch relation to emotion
title Determinism in speech pitch relation to emotion
title_full Determinism in speech pitch relation to emotion
title_fullStr Determinism in speech pitch relation to emotion
title_full_unstemmed Determinism in speech pitch relation to emotion
title_short Determinism in speech pitch relation to emotion
title_sort determinism in speech pitch relation to emotion
topic BF Psychology
P Philology. Linguistics
url http://eprints.sunway.edu.my/60/
http://eprints.sunway.edu.my/60/