A lip geometry approach for feature-fusion based audio-visual speech recognition
This paper describes a feature-fusion audio-visual speech recognition (AVSR) system that extracts lip geometry from the mouth region using a combination of skin color filter, border following and convex hull, and classification using a Hidden Markov Model. By defining a small number of highly descri...
| Main Authors: | M. Z., Ibrahim, Mulvaney, D. J. |
|---|---|
| Format: | Conference or Workshop Item |
| Language: | English English |
| Published: |
IEEE
2014
|
| Subjects: | |
| Online Access: | http://umpir.ump.edu.my/id/eprint/29900/ http://umpir.ump.edu.my/id/eprint/29900/1/A%20lip%20geometry%20approach%20for%20feature-fusion%20based%20audio.pdf http://umpir.ump.edu.my/id/eprint/29900/2/A%20lip%20geometry%20approach%20for%20feature-fusion%20based%20audio_FULL.pdf |
Similar Items
Feature-Fusion based Audio-Visual Speech Recognition using Lip Geometry Features in Noisy Environment
by: M. Z., Ibrahim, et al.
Published: (2015)
by: M. Z., Ibrahim, et al.
Published: (2015)
A novel lip geometry approach for audio-visual speech recognition
by: Mohd Zamri, Ibrahim
Published: (2014)
by: Mohd Zamri, Ibrahim
Published: (2014)
WADA-W: A modified WADA SNR estimator for audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2019)
by: Thum, Wei Seong, et al.
Published: (2019)
A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018)
by: Thum, Wei Seong, et al.
Published: (2018)
Result comparison of model validation techniques on audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2017)
by: Thum, Wei Seong, et al.
Published: (2017)
Geometry based lip reading system using Multi Dimension Dynamic Time Warping
by: M. Z., Ibrahim, et al.
Published: (2012)
by: M. Z., Ibrahim, et al.
Published: (2012)
Development of audio-visual speech recognition using deep-learning technique
by: How, Chun Kit, et al.
Published: (2022)
by: How, Chun Kit, et al.
Published: (2022)
Development on SNR estimator for audio-visual speech recognition based on waveform amplitude distribution analysis
by: Thum, Wei Seong
Published: (2018)
by: Thum, Wei Seong
Published: (2018)
Geometrical-Based Lip-Reading using Template Probabilistic Multi-Dimension Dynamic Time Warping
by: M. Z., Ibrahim, et al.
Published: (2015)
by: M. Z., Ibrahim, et al.
Published: (2015)
Image fusion based multi resolution and frequency partition discrete cosine transform for palm vein recognition
by: Soh, Shi Chuan, et al.
Published: (2019)
by: Soh, Shi Chuan, et al.
Published: (2019)
The Effectiveness of DTW-FF Coefficients and Pitch Feature in NN Speech Recognition
by: Sudirman, Rubita, et al.
Published: (2006)
by: Sudirman, Rubita, et al.
Published: (2006)
DTWFF-pitch feature and faster neural network convergence for speech recognition
by: Sudirman, Rubita, et al.
Published: (2007)
by: Sudirman, Rubita, et al.
Published: (2007)
Identification of audio and room parameters for optimum speech intelligibility in room
by: Ng, Tsing Chun
Published: (2007)
by: Ng, Tsing Chun
Published: (2007)
Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
by: Ahmad Qadri, Syed Asif, et al.
Published: (2020)
by: Ahmad Qadri, Syed Asif, et al.
Published: (2020)
English learning system using speech recognition for visual impaired user (ELSRVI)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)
Command by speech recognition
by: Ardian Syah, Mohd Yusof
Published: (2008)
by: Ardian Syah, Mohd Yusof
Published: (2008)
Deep learning-based audio-visual speech recognition for Bosnian digits
by: Husein Fazlić,, et al.
Published: (2024)
by: Husein Fazlić,, et al.
Published: (2024)
Palm vein recognition using scale invariant feature transform with RANSAC mismatching removal
by: Shi Chuan, Soh, et al.
Published: (2017)
by: Shi Chuan, Soh, et al.
Published: (2017)
Application of Speech Recognition for Swiftlet Vocalizations
by: Siti Nurzalikha Zaini, Husni Zaini, et al.
Published: (2013)
by: Siti Nurzalikha Zaini, Husni Zaini, et al.
Published: (2013)
Wavelet cesptral coefficients for isolated speech recognition
by: Adam, Tarmizi, et al.
Published: (2013)
by: Adam, Tarmizi, et al.
Published: (2013)
Wavelet cepstral coefficients for isolated speech recognition
by: Adam, Tarmizi, et al.
Published: (2012)
by: Adam, Tarmizi, et al.
Published: (2012)
Recurrent neural network with backpropagation through time for speech recognition
by: Ahmad, A. M., et al.
Published: (2004)
by: Ahmad, A. M., et al.
Published: (2004)
Signal segmentation and its application in the feature extraction of speech
by: Abdul Rahman, Ahmad Idil, et al.
Published: (2000)
by: Abdul Rahman, Ahmad Idil, et al.
Published: (2000)
DC motor speed regulation using speech recognition
by: Firda, Andriyan, et al.
Published: (2021)
by: Firda, Andriyan, et al.
Published: (2021)
Angular features analysis for gait recognition
by: Mohd. Isa, Nur Shahidah, et al.
Published: (2005)
by: Mohd. Isa, Nur Shahidah, et al.
Published: (2005)
An improved method in speech signal input representation based on DTW technique for NN speech recognition system
by: Sudirman, Rubita, et al.
Published: (2007)
by: Sudirman, Rubita, et al.
Published: (2007)
Implementation of Speech Recognition Home Control System Using Arduino
by: Nurul Fadzilah, Hasan, et al.
Published: (2015)
by: Nurul Fadzilah, Hasan, et al.
Published: (2015)
Investigation of lossless audio compression using IEEE 1857.2 advanced audio coding
by: Gunawan, Teddy Surya, et al.
Published: (2017)
by: Gunawan, Teddy Surya, et al.
Published: (2017)
3D lips development and measurement for visual speech synthesis
by: Salleh, Siti Salwa, et al.
Published: (2009)
by: Salleh, Siti Salwa, et al.
Published: (2009)
Independent learning of Quran (ILoQ)- alphabet using speech recognition
by: Shafiqah, Sholehuddin
Published: (2011)
by: Shafiqah, Sholehuddin
Published: (2011)
Speech processing for makhraj recognition (design adaptive filter for noise removal)
by: Siti Nurmaisarah, Abdul Aziz
Published: (2010)
by: Siti Nurmaisarah, Abdul Aziz
Published: (2010)
Speech processing for makhraj recognition: The design of adaptive filter for noise canceller
by: Nurul Wahidah, Arshad, et al.
Published: (2011)
by: Nurul Wahidah, Arshad, et al.
Published: (2011)
NN speech recognition utilizing aligned DTW local distance scores
by: Sudirman, Rubita, et al.
Published: (2005)
by: Sudirman, Rubita, et al.
Published: (2005)
Development of real-time embedded system with speech recognition for smart house
by: Yahya, Zuraimi, et al.
Published: (2007)
by: Yahya, Zuraimi, et al.
Published: (2007)
Malay continuous speech recognition using continuous density hidden Markov model
by: Ting, Chee Ming
Published: (2007)
by: Ting, Chee Ming
Published: (2007)
NN with DTW-FF Coefficients and Pitch Feature for Speaker Recognition
by: Sudirman, Rubita, et al.
Published: (2006)
by: Sudirman, Rubita, et al.
Published: (2006)
Investigation of various algorithms on multichannel audio compression
by: Gunawan, Teddy Surya, et al.
Published: (2017)
by: Gunawan, Teddy Surya, et al.
Published: (2017)
Multichannel audio compression: performance evaluation of various algorithms
by: Abdul Rashid, Siti Aisyah, et al.
Published: (2017)
by: Abdul Rashid, Siti Aisyah, et al.
Published: (2017)
A review of lossless audio compression standards and algorithms
by: Abdul Muin, Fathiah, et al.
Published: (2017)
by: Abdul Muin, Fathiah, et al.
Published: (2017)
Local DTW Coefficients and Pitch Feature for Back-Propagation NN Digits Recognition
by: Sudirman, Rubita, et al.
Published: (2006)
by: Sudirman, Rubita, et al.
Published: (2006)
Similar Items
-
Feature-Fusion based Audio-Visual Speech Recognition using Lip Geometry Features in Noisy Environment
by: M. Z., Ibrahim, et al.
Published: (2015) -
A novel lip geometry approach for audio-visual speech recognition
by: Mohd Zamri, Ibrahim
Published: (2014) -
WADA-W: A modified WADA SNR estimator for audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2019) -
A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018) -
Result comparison of model validation techniques on audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2017)