Development of audio-visual speech recognition using deep-learning technique
Deep learning is a technique with artificial intelligent (AI) that simulate humans’ learning behavior. Audio-visual speech recognition is important for the listener understand the emotions behind the spoken words truly. In this thesis, two different deep learning models, Convolutional Neural Network...
| Main Authors: | How, Chun Kit, Mohd Khairuddin, Ismail, Mohd Razman, Mohd Azraai, Anwar, P. P. Abdul Majeed, Mohd Isa, Wan Hasbullah |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Penerbit UMP
2022
|
| Subjects: | |
| Online Access: | http://umpir.ump.edu.my/id/eprint/37244/ http://umpir.ump.edu.my/id/eprint/37244/1/Development%20of%20audio%20visual%20speech%20recognition.pdf |
Similar Items
A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018)
by: Thum, Wei Seong, et al.
Published: (2018)
Result comparison of model validation techniques on audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2017)
by: Thum, Wei Seong, et al.
Published: (2017)
A novel lip geometry approach for audio-visual speech recognition
by: Mohd Zamri, Ibrahim
Published: (2014)
by: Mohd Zamri, Ibrahim
Published: (2014)
Deep learning-based audio-visual speech recognition for Bosnian digits
by: Husein Fazlić,, et al.
Published: (2024)
by: Husein Fazlić,, et al.
Published: (2024)
Badminton smashing recognition through video performance by using deep learning
by: Yip, Zi Ying, et al.
Published: (2022)
by: Yip, Zi Ying, et al.
Published: (2022)
A lip geometry approach for feature-fusion based audio-visual speech recognition
by: M. Z., Ibrahim, et al.
Published: (2014)
by: M. Z., Ibrahim, et al.
Published: (2014)
WADA-W: A modified WADA SNR estimator for audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2019)
by: Thum, Wei Seong, et al.
Published: (2019)
Development on SNR estimator for audio-visual speech recognition based on waveform amplitude distribution analysis
by: Thum, Wei Seong
Published: (2018)
by: Thum, Wei Seong
Published: (2018)
Feature-Fusion based Audio-Visual Speech Recognition using Lip Geometry Features in Noisy Environment
by: M. Z., Ibrahim, et al.
Published: (2015)
by: M. Z., Ibrahim, et al.
Published: (2015)
Deep word embeddings for visual speech recognition
by: Stafylakis, Themos, et al.
Published: (2018)
by: Stafylakis, Themos, et al.
Published: (2018)
Identification of audio and room parameters for optimum speech intelligibility in room
by: Ng, Tsing Chun
Published: (2007)
by: Ng, Tsing Chun
Published: (2007)
Sign language recognition using deep learning through LSTM and CNN
by: Kiran, Pandian, et al.
Published: (2023)
by: Kiran, Pandian, et al.
Published: (2023)
Deep learning for environmentally robust speech recognition
by: Alhamada, A. I., et al.
Published: (2020)
by: Alhamada, A. I., et al.
Published: (2020)
Human activity recognition based on wrist PPG via the ensemble method
by: Almanifi, Omair Rashed Abdulwareth, et al.
Published: (2022)
by: Almanifi, Omair Rashed Abdulwareth, et al.
Published: (2022)
Deep learning based human presence detection
by: Venketaramana, Balachandran, et al.
Published: (2020)
by: Venketaramana, Balachandran, et al.
Published: (2020)
Speech emotion recognition using deep feedforward neural network
by: Alghifari, Muhammad Fahreza, et al.
Published: (2018)
by: Alghifari, Muhammad Fahreza, et al.
Published: (2018)
Command by speech recognition
by: Ardian Syah, Mohd Yusof
Published: (2008)
by: Ardian Syah, Mohd Yusof
Published: (2008)
The Efficacy of Deep Learning-Based Mixed Model for Speech Emotion Recognition
by: Uddin, Mohammad Amaz, et al.
Published: (2022)
by: Uddin, Mohammad Amaz, et al.
Published: (2022)
English learning system using speech recognition for visual impaired user (ELSRVI)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)
The classification of heart murmurs: The identification of significant time domain features
by: Cheng, Wai Kit, et al.
Published: (2020)
by: Cheng, Wai Kit, et al.
Published: (2020)
Articulated robot arm
by: Mohamad Hafiz, Mohd Fauzi, et al.
Published: (2021)
by: Mohamad Hafiz, Mohd Fauzi, et al.
Published: (2021)
An improved method in speech signal input representation based on DTW technique for NN speech recognition system
by: Sudirman, Rubita, et al.
Published: (2007)
by: Sudirman, Rubita, et al.
Published: (2007)
Automated detection of knee cartilage region in X-ray image
by: Teo, Jia Chern, et al.
Published: (2022)
by: Teo, Jia Chern, et al.
Published: (2022)
Deep learning for emotional speech recognition
by: Alhamada, M. I., et al.
Published: (2020)
by: Alhamada, M. I., et al.
Published: (2020)
Design a precision motion control of an upper limb robotic arm
by: Kwan, Ze Zhen, et al.
Published: (2022)
by: Kwan, Ze Zhen, et al.
Published: (2022)
The classification of oral squamous cell carcinoma (OSCC) by means of transfer learning
by: Ahmad Ridhauddin, Abdul Rauf, et al.
Published: (2022)
by: Ahmad Ridhauddin, Abdul Rauf, et al.
Published: (2022)
Analysis of auditory evoked potential signals using wavelet transform and deep learning techniques
by: Islam, Md Nahidul, et al.
Published: (2020)
by: Islam, Md Nahidul, et al.
Published: (2020)
Application of Speech Recognition for Swiftlet Vocalizations
by: Siti Nurzalikha Zaini, Husni Zaini, et al.
Published: (2013)
by: Siti Nurzalikha Zaini, Husni Zaini, et al.
Published: (2013)
Correlation of ray tracing technique and ITDG to predict speech intelligibility
by: Mun, Hou Kit
Published: (2007)
by: Mun, Hou Kit
Published: (2007)
Mode choice prediction using machine learning technique for a door-to-door journey in Kuantan City
by: Nur Fahriza, Mohd Ali, et al.
Published: (2020)
by: Nur Fahriza, Mohd Ali, et al.
Published: (2020)
The diagnosis of diabetic retinopathy by means of transfer learning and fine-tuned dense layer pipeline
by: Abdo Salman, Abdulaziz, et al.
Published: (2020)
by: Abdo Salman, Abdulaziz, et al.
Published: (2020)
Visual Crowd Counting System Using Deep Learning
by: Mohd Wafi Nazrul Adam, Mohd Ridhwan Oxley Adam
Published: (2021)
by: Mohd Wafi Nazrul Adam, Mohd Ridhwan Oxley Adam
Published: (2021)
EffectiveWatermarking of Digital Audio and Image using Matlab Technique
by: Subbarayan, Sadasivam, et al.
Published: (2009)
by: Subbarayan, Sadasivam, et al.
Published: (2009)
Wavelet cesptral coefficients for isolated speech recognition
by: Adam, Tarmizi, et al.
Published: (2013)
by: Adam, Tarmizi, et al.
Published: (2013)
Wavelet cepstral coefficients for isolated speech recognition
by: Adam, Tarmizi, et al.
Published: (2012)
by: Adam, Tarmizi, et al.
Published: (2012)
Match outcomes prediction of six top English Premier League clubs via machine learning technique
by: Rabiu Muazu, Musa, et al.
Published: (2018)
by: Rabiu Muazu, Musa, et al.
Published: (2018)
Formulation of a deep learning model for automated detection via segmentation of lung cancer
by: Liew, Yee Zhing, et al.
Published: (2024)
by: Liew, Yee Zhing, et al.
Published: (2024)
Audio networks for speech enhancement and indexing
by: Kühnapfel, Thorsten
Published: (2009)
by: Kühnapfel, Thorsten
Published: (2009)
Image approach to english digits recognition using deep learning
by: Fatin Nur Amalina, Zainol, et al.
Published: (2022)
by: Fatin Nur Amalina, Zainol, et al.
Published: (2022)
A mobile application of augmented reality for periodic table with speech recognition
by: Sen, Tan Chee, et al.
Published: (2021)
by: Sen, Tan Chee, et al.
Published: (2021)
Similar Items
-
A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018) -
Result comparison of model validation techniques on audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2017) -
A novel lip geometry approach for audio-visual speech recognition
by: Mohd Zamri, Ibrahim
Published: (2014) -
Deep learning-based audio-visual speech recognition for Bosnian digits
by: Husein Fazlić,, et al.
Published: (2024) -
Badminton smashing recognition through video performance by using deep learning
by: Yip, Zi Ying, et al.
Published: (2022)