Impact of acoustical voice activity detection on spontaneous filled pause classification
Filled pause detection is imperative for spontaneous speech recognition as it may degrade speech recognition rate. However, filled pause is commonly confused with elongation as they shared the same acoustical properties. Few attempts of classifying filled pause and elongation employed Hidden Markov...
| Main Authors: | , , , , |
|---|---|
| Format: | Conference or Workshop Item |
| Language: | English |
| Published: |
IEEE
2014
|
| Online Access: | http://psasir.upm.edu.my/id/eprint/56314/ http://psasir.upm.edu.my/id/eprint/56314/1/Impact%20of%20acoustical%20voice%20activity%20detection%20on%20spontaneous%20filled%20pause%20classification.pdf |
| _version_ | 1848853048303026176 |
|---|---|
| author | Hamzah, Raseeda Jamil, Nursuriati Seman, Noraini Ardi, Norizah C. Doraisamy, Shyamala |
| author_facet | Hamzah, Raseeda Jamil, Nursuriati Seman, Noraini Ardi, Norizah C. Doraisamy, Shyamala |
| author_sort | Hamzah, Raseeda |
| building | UPM Institutional Repository |
| collection | Online Access |
| description | Filled pause detection is imperative for spontaneous speech recognition as it may degrade speech recognition rate. However, filled pause is commonly confused with elongation as they shared the same acoustical properties. Few attempts of classifying filled pause and elongation employed Hidden Markov model. Our proposed method of utilizing Neural Network as a classifier achieved 96% precision rate. We also proved that voice activity detection (VAD) affects the performance of speech recognition. Three acoustical-based VAD are compared and the best precision rate is achieved by incorporating volume and first-order difference features. Experiments are conducted using Malay language spontaneous speeches of Malaysia Parliamentary Debate sessions. |
| first_indexed | 2025-11-15T10:47:46Z |
| format | Conference or Workshop Item |
| id | upm-56314 |
| institution | Universiti Putra Malaysia |
| institution_category | Local University |
| language | English |
| last_indexed | 2025-11-15T10:47:46Z |
| publishDate | 2014 |
| publisher | IEEE |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | upm-563142017-07-31T05:22:11Z http://psasir.upm.edu.my/id/eprint/56314/ Impact of acoustical voice activity detection on spontaneous filled pause classification Hamzah, Raseeda Jamil, Nursuriati Seman, Noraini Ardi, Norizah C. Doraisamy, Shyamala Filled pause detection is imperative for spontaneous speech recognition as it may degrade speech recognition rate. However, filled pause is commonly confused with elongation as they shared the same acoustical properties. Few attempts of classifying filled pause and elongation employed Hidden Markov model. Our proposed method of utilizing Neural Network as a classifier achieved 96% precision rate. We also proved that voice activity detection (VAD) affects the performance of speech recognition. Three acoustical-based VAD are compared and the best precision rate is achieved by incorporating volume and first-order difference features. Experiments are conducted using Malay language spontaneous speeches of Malaysia Parliamentary Debate sessions. IEEE 2014 Conference or Workshop Item PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/56314/1/Impact%20of%20acoustical%20voice%20activity%20detection%20on%20spontaneous%20filled%20pause%20classification.pdf Hamzah, Raseeda and Jamil, Nursuriati and Seman, Noraini and Ardi, Norizah and C. Doraisamy, Shyamala (2014) Impact of acoustical voice activity detection on spontaneous filled pause classification. In: 2014 IEEE Conference on Open Systems (ICOS), 26-28 Oct. 2014, Subang Jaya, Selangor, Malaysia. (pp. 1-6). 10.1109/ICOS.2014.7042400 |
| spellingShingle | Hamzah, Raseeda Jamil, Nursuriati Seman, Noraini Ardi, Norizah C. Doraisamy, Shyamala Impact of acoustical voice activity detection on spontaneous filled pause classification |
| title | Impact of acoustical voice activity detection on spontaneous filled pause classification |
| title_full | Impact of acoustical voice activity detection on spontaneous filled pause classification |
| title_fullStr | Impact of acoustical voice activity detection on spontaneous filled pause classification |
| title_full_unstemmed | Impact of acoustical voice activity detection on spontaneous filled pause classification |
| title_short | Impact of acoustical voice activity detection on spontaneous filled pause classification |
| title_sort | impact of acoustical voice activity detection on spontaneous filled pause classification |
| url | http://psasir.upm.edu.my/id/eprint/56314/ http://psasir.upm.edu.my/id/eprint/56314/ http://psasir.upm.edu.my/id/eprint/56314/1/Impact%20of%20acoustical%20voice%20activity%20detection%20on%20spontaneous%20filled%20pause%20classification.pdf |