Result comparison of model validation techniques on audio-visual speech recognition
This paper implements and compares the performance of a number of techniques proposed for improving the accuracy of Automatic Speech Recognition (ASR) systems. As ASR that uses only speech can be contaminated by environmental noise, in some applications it may improve performance to employ Audio-Vis...
| Main Authors: | Thum, Wei Seong, M. Z., Ibrahim, Nurul Wahidah, Arshad, D.J., Mulvaney |
|---|---|
| Format: | Book Chapter |
| Language: | English English |
| Published: |
Springer, Singapore
2017
|
| Subjects: | |
| Online Access: | http://umpir.ump.edu.my/id/eprint/20566/ http://umpir.ump.edu.my/id/eprint/20566/13/78.%20Result%20Comparison%20of%20Model%20Validation%20Techniques%20on%20Audio-Visual%20Speech%20Recognition.pdf http://umpir.ump.edu.my/id/eprint/20566/14/78.%20A%20Comparison%20of%20Model%20Validation%20Techniques%20on%20Audio-Visual%20Speech%20Recognition.pdf |
Similar Items
A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018)
by: Thum, Wei Seong, et al.
Published: (2018)
WADA-W: A modified WADA SNR estimator for audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2019)
by: Thum, Wei Seong, et al.
Published: (2019)
Development on SNR estimator for audio-visual speech recognition based on waveform amplitude distribution analysis
by: Thum, Wei Seong
Published: (2018)
by: Thum, Wei Seong
Published: (2018)
A lip geometry approach for feature-fusion based audio-visual speech recognition
by: M. Z., Ibrahim, et al.
Published: (2014)
by: M. Z., Ibrahim, et al.
Published: (2014)
Feature-Fusion based Audio-Visual Speech Recognition using Lip Geometry Features in Noisy Environment
by: M. Z., Ibrahim, et al.
Published: (2015)
by: M. Z., Ibrahim, et al.
Published: (2015)
A novel lip geometry approach for audio-visual speech recognition
by: Mohd Zamri, Ibrahim
Published: (2014)
by: Mohd Zamri, Ibrahim
Published: (2014)
Development of audio-visual speech recognition using deep-learning technique
by: How, Chun Kit, et al.
Published: (2022)
by: How, Chun Kit, et al.
Published: (2022)
Palm vein recognition using scale invariant feature transform with RANSAC mismatching removal
by: Shi Chuan, Soh, et al.
Published: (2017)
by: Shi Chuan, Soh, et al.
Published: (2017)
Comparison between fuzzy and NN method for speech emotion recognition
by: Razak,, AA, et al.
Published: (2005)
by: Razak,, AA, et al.
Published: (2005)
Deep learning-based audio-visual speech recognition for Bosnian digits
by: Husein Fazlić,, et al.
Published: (2024)
by: Husein Fazlić,, et al.
Published: (2024)
Arabic automatic continuous speech recognition systems
by: Abushariah, Mohammad A. M., et al.
Published: (2011)
by: Abushariah, Mohammad A. M., et al.
Published: (2011)
LPC and its derivatives for stuttered speech recognition
by: Alim, Sabur Ajibola, et al.
Published: (2015)
by: Alim, Sabur Ajibola, et al.
Published: (2015)
Identification of audio and room parameters for optimum speech intelligibility in room
by: Ng, Tsing Chun
Published: (2007)
by: Ng, Tsing Chun
Published: (2007)
Human posture recognition database and preprocessing simulation results
by: Htike, Kyaw Kyaw, et al.
Published: (2011)
by: Htike, Kyaw Kyaw, et al.
Published: (2011)
Human posture recognition results using database A
by: Htike, Kyaw Kyaw, et al.
Published: (2011)
by: Htike, Kyaw Kyaw, et al.
Published: (2011)
Heterogeneous driver behavior state recognition using speech signal
by: Kamaruddin, Norhaslinda, et al.
Published: (2011)
by: Kamaruddin, Norhaslinda, et al.
Published: (2011)
Speech features analysis of the joint speech separation and automatic speech recognition model / Tawseef Khan
by: Tawseef , Khan
Published: (2021)
by: Tawseef , Khan
Published: (2021)
English digits speech recognition system based on Hidden Markov Models
by: Abushariah, Ahmad A. M., et al.
Published: (2010)
by: Abushariah, Ahmad A. M., et al.
Published: (2010)
English learning system using speech recognition for visual impaired user (ELSRVI)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)
Speech Recognition and Speech Synthesis Repository System / Ng Koon Lee
by: Ng, Koon Lee
Published: (2002)
by: Ng, Koon Lee
Published: (2002)
English digits speech recognition system based on hidden Markov Models
by: Gunawan, Teddy Surya, et al.
Published: (2011)
by: Gunawan, Teddy Surya, et al.
Published: (2011)
Speech processing for makhraj recognition: The design of adaptive filter for noise canceller
by: Nurul Wahidah, Arshad, et al.
Published: (2011)
by: Nurul Wahidah, Arshad, et al.
Published: (2011)
Command by speech recognition
by: Ardian Syah, Mohd Yusof
Published: (2008)
by: Ardian Syah, Mohd Yusof
Published: (2008)
Bimodal person identification system based on speech and signature recognition's
by: Gunawan, Teddy Surya
Published: (2013)
by: Gunawan, Teddy Surya
Published: (2013)
Emotion speech recognition using KDE and MLP neural networks
by: Abdul Rahman, Abdul Wahab, et al.
Published: (2011)
by: Abdul Rahman, Abdul Wahab, et al.
Published: (2011)
Speech recognition for Malay Language to command & control computer program / Mohd Junaidi Jusoh
by: Jusoh, Mohd Junaidi
Published: (2007)
by: Jusoh, Mohd Junaidi
Published: (2007)
Embedded Character Recognition System using Random Forest Algorithm for IC Inspection System
by: Chong, Wei Jian, et al.
Published: (2017)
by: Chong, Wei Jian, et al.
Published: (2017)
Isolated Malay speech recognition using fuzzy logic
by: Normiza, Mohd Yusof
Published: (2019)
by: Normiza, Mohd Yusof
Published: (2019)
Analyzing driving behaviour using speech recognition through KDE and MLP
by: Abdul Rahman, Abdul Wahab, et al.
Published: (2011)
by: Abdul Rahman, Abdul Wahab, et al.
Published: (2011)
Developing a model of speech recognition process from an autopoietic approach
by: Liew, Eng Siang, et al.
Published: (2000)
by: Liew, Eng Siang, et al.
Published: (2000)
Speaker’s variabilities, technology and language issues that affect automatic speech and speaker recognition systems
by: Abushariah, Mohammad A. M., et al.
Published: (2011)
by: Abushariah, Mohammad A. M., et al.
Published: (2011)
Speech recognition to determine emotions using quick propagation neural network / Aisyah Mohamad Tojid
by: Mohamad Tojid, Aisyah
Published: (2006)
by: Mohamad Tojid, Aisyah
Published: (2006)
Named-entity recognition for numerical expression in Malay text-to-speech systems / Lit Wei Wern
by: Lit , Wei Wern
Published: (2019)
by: Lit , Wei Wern
Published: (2019)
Isolated word speech recognition of the malay digits
by: Mohamad Nasir, Haidawati, et al.
Published: (2014)
by: Mohamad Nasir, Haidawati, et al.
Published: (2014)
Genetic based substitution techniques for audio steganography
by: Zamani, Mazdak
Published: (2010)
by: Zamani, Mazdak
Published: (2010)
Application of Speech Recognition for Swiftlet Vocalizations
by: Siti Nurzalikha Zaini, Husni Zaini, et al.
Published: (2013)
by: Siti Nurzalikha Zaini, Husni Zaini, et al.
Published: (2013)
A genetic-algorithm-based approach for audio steganography
by: Zamani, Mazdak, et al.
Published: (2009)
by: Zamani, Mazdak, et al.
Published: (2009)
A Novel Approach for Audio Watermarking
by: Zamani, Mazdak, et al.
Published: (2009)
by: Zamani, Mazdak, et al.
Published: (2009)
Wavelet cesptral coefficients for isolated speech recognition
by: Adam, Tarmizi, et al.
Published: (2013)
by: Adam, Tarmizi, et al.
Published: (2013)
Wavelet cepstral coefficients for isolated speech recognition
by: Adam, Tarmizi, et al.
Published: (2012)
by: Adam, Tarmizi, et al.
Published: (2012)
Similar Items
-
A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018) -
WADA-W: A modified WADA SNR estimator for audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2019) -
Development on SNR estimator for audio-visual speech recognition based on waveform amplitude distribution analysis
by: Thum, Wei Seong
Published: (2018) -
A lip geometry approach for feature-fusion based audio-visual speech recognition
by: M. Z., Ibrahim, et al.
Published: (2014) -
Feature-Fusion based Audio-Visual Speech Recognition using Lip Geometry Features in Noisy Environment
by: M. Z., Ibrahim, et al.
Published: (2015)