Development of audio-visual speech recognition using deep-learning technique

QR Code

Development of audio-visual speech recognition using deep-learning technique

Deep learning is a technique with artificial intelligent (AI) that simulate humans’ learning behavior. Audio-visual speech recognition is important for the listener understand the emotions behind the spoken words truly. In this thesis, two different deep learning models, Convolutional Neural Network...

Full description

Bibliographic Details
Main Authors:	How, Chun Kit, Mohd Khairuddin, Ismail, Mohd Razman, Mohd Azraai, Anwar, P. P. Abdul Majeed, Mohd Isa, Wan Hasbullah
Format:	Article
Language:	English
Published:	Penerbit UMP 2022
Subjects:	TJ Mechanical engineering and machinery TK Electrical engineering. Electronics Nuclear engineering TS Manufactures
Online Access:	http://umpir.ump.edu.my/id/eprint/37244/ http://umpir.ump.edu.my/id/eprint/37244/1/Development%20of%20audio%20visual%20speech%20recognition.pdf

Similar Items

A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018)

Result comparison of model validation techniques on audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2017)

A novel lip geometry approach for audio-visual speech recognition
by: Mohd Zamri, Ibrahim
Published: (2014)

Deep learning-based audio-visual speech recognition for Bosnian digits
by: Husein Fazlić,, et al.
Published: (2024)

Badminton smashing recognition through video performance by using deep learning
by: Yip, Zi Ying, et al.
Published: (2022)

A lip geometry approach for feature-fusion based audio-visual speech recognition
by: M. Z., Ibrahim, et al.
Published: (2014)

WADA-W: A modified WADA SNR estimator for audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2019)

Development on SNR estimator for audio-visual speech recognition based on waveform amplitude distribution analysis
by: Thum, Wei Seong
Published: (2018)

Feature-Fusion based Audio-Visual Speech Recognition using Lip Geometry Features in Noisy Environment
by: M. Z., Ibrahim, et al.
Published: (2015)

Deep word embeddings for visual speech recognition
by: Stafylakis, Themos, et al.
Published: (2018)

Identification of audio and room parameters for optimum speech intelligibility in room
by: Ng, Tsing Chun
Published: (2007)

Sign language recognition using deep learning through LSTM and CNN
by: Kiran, Pandian, et al.
Published: (2023)

Deep learning for environmentally robust speech recognition
by: Alhamada, A. I., et al.
Published: (2020)

Human activity recognition based on wrist PPG via the ensemble method
by: Almanifi, Omair Rashed Abdulwareth, et al.
Published: (2022)

Deep learning based human presence detection
by: Venketaramana, Balachandran, et al.
Published: (2020)

Speech emotion recognition using deep feedforward neural network
by: Alghifari, Muhammad Fahreza, et al.
Published: (2018)

Command by speech recognition
by: Ardian Syah, Mohd Yusof
Published: (2008)

The Efficacy of Deep Learning-Based Mixed Model for Speech Emotion Recognition
by: Uddin, Mohammad Amaz, et al.
Published: (2022)

English learning system using speech recognition for visual impaired user (ELSRVI)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)

The classification of heart murmurs: The identification of significant time domain features
by: Cheng, Wai Kit, et al.
Published: (2020)

Articulated robot arm
by: Mohamad Hafiz, Mohd Fauzi, et al.
Published: (2021)

Deep learning for emotional speech recognition
by: Alhamada, M. I., et al.
Published: (2020)

An improved method in speech signal input representation based on DTW technique for NN speech recognition system
by: Sudirman, Rubita, et al.
Published: (2007)

Automated detection of knee cartilage region in X-ray image
by: Teo, Jia Chern, et al.
Published: (2022)

Design a precision motion control of an upper limb robotic arm
by: Kwan, Ze Zhen, et al.
Published: (2022)

The classification of oral squamous cell carcinoma (OSCC) by means of transfer learning
by: Ahmad Ridhauddin, Abdul Rauf, et al.
Published: (2022)

Application of Speech Recognition for Swiftlet Vocalizations
by: Siti Nurzalikha Zaini, Husni Zaini, et al.
Published: (2013)

Analysis of auditory evoked potential signals using wavelet transform and deep learning techniques
by: Islam, Md Nahidul, et al.
Published: (2020)

Correlation of ray tracing technique and ITDG to predict speech intelligibility
by: Mun, Hou Kit
Published: (2007)

Mode choice prediction using machine learning technique for a door-to-door journey in Kuantan City
by: Nur Fahriza, Mohd Ali, et al.
Published: (2020)

The diagnosis of diabetic retinopathy by means of transfer learning and fine-tuned dense layer pipeline
by: Abdo Salman, Abdulaziz, et al.
Published: (2020)

Visual Crowd Counting System Using Deep Learning
by: Mohd Wafi Nazrul Adam, Mohd Ridhwan Oxley Adam
Published: (2021)

Wavelet cesptral coefficients for isolated speech recognition
by: Adam, Tarmizi, et al.
Published: (2013)

Wavelet cepstral coefﬁcients for isolated speech recognition
by: Adam, Tarmizi, et al.
Published: (2012)

EffectiveWatermarking of Digital Audio and Image using Matlab Technique
by: Subbarayan, Sadasivam, et al.
Published: (2009)

Match outcomes prediction of six top English Premier League clubs via machine learning technique
by: Rabiu Muazu, Musa, et al.
Published: (2018)

Formulation of a deep learning model for automated detection via segmentation of lung cancer
by: Liew, Yee Zhing, et al.
Published: (2024)

Audio networks for speech enhancement and indexing
by: Kühnapfel, Thorsten
Published: (2009)

Image approach to english digits recognition using deep learning
by: Fatin Nur Amalina, Zainol, et al.
Published: (2022)

A mobile application of augmented reality for periodic table with speech recognition
by: Sen, Tan Chee, et al.
Published: (2021)