Deep learning-based audio-visual speech recognition for Bosnian digits
This study presents a deep learning-based solution for audio-visual speech recognition of Bosnian digits. The task posed a challenge due to the lack of an appropriate Bosnian language dataset, and this study outlines the approach to building a new dataset. The proposed solution includes two comp...
| Main Authors: | Husein Fazlić, Ali Abd Almisreb, Nooritawati Md Tahir |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Penerbit Universiti Kebangsaan Malaysia
2024
|
| Online Access: | http://journalarticle.ukm.my/25132/ http://journalarticle.ukm.my/25132/1/14.pdf |
Similar Items
Development of audio-visual speech recognition using deep-learning technique
by: How, Chun Kit, et al.
Published: (2022)
by: How, Chun Kit, et al.
Published: (2022)
A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018)
by: Thum, Wei Seong, et al.
Published: (2018)
A novel lip geometry approach for audio-visual speech recognition
by: Mohd Zamri, Ibrahim
Published: (2014)
by: Mohd Zamri, Ibrahim
Published: (2014)
Result comparison of model validation techniques on audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2017)
by: Thum, Wei Seong, et al.
Published: (2017)
Deep word embeddings for visual speech recognition
by: Stafylakis, Themos, et al.
Published: (2018)
by: Stafylakis, Themos, et al.
Published: (2018)
A lip geometry approach for feature-fusion based audio-visual speech recognition
by: M. Z., Ibrahim, et al.
Published: (2014)
by: M. Z., Ibrahim, et al.
Published: (2014)
WADA-W: A modified WADA SNR estimator for audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2019)
by: Thum, Wei Seong, et al.
Published: (2019)
Deep learning for emotional speech recognition
by: Alhamada, M. I., et al.
Published: (2020)
by: Alhamada, M. I., et al.
Published: (2020)
Development on SNR estimator for audio-visual speech recognition based on waveform amplitude distribution analysis
by: Thum, Wei Seong
Published: (2018)
by: Thum, Wei Seong
Published: (2018)
Deep learning for environmentally robust speech recognition
by: Alhamada, A. I., et al.
Published: (2020)
by: Alhamada, A. I., et al.
Published: (2020)
Feature-Fusion based Audio-Visual Speech Recognition using Lip Geometry Features in Noisy Environment
by: M. Z., Ibrahim, et al.
Published: (2015)
by: M. Z., Ibrahim, et al.
Published: (2015)
The Efficacy of Deep Learning-Based Mixed Model for Speech Emotion Recognition
by: Uddin, Mohammad Amaz, et al.
Published: (2022)
by: Uddin, Mohammad Amaz, et al.
Published: (2022)
Bosnian, Croatian, Serbian: Inherent translanguaging in the linguistic landscape of Sarajevo
by: Tankosic, Ana, et al.
Published: (2021)
by: Tankosic, Ana, et al.
Published: (2021)
The Bosnian European membership deadlock - A Brussels’
credibility crisis
by: Brljavac, Bedrudin
Published: (2011)
by: Brljavac, Bedrudin
Published: (2011)
Audio networks for speech enhancement and indexing
by: Kühnapfel, Thorsten
Published: (2009)
by: Kühnapfel, Thorsten
Published: (2009)
Image approach to english digits recognition using deep learning
by: Fatin Nur Amalina, Zainol, et al.
Published: (2022)
by: Fatin Nur Amalina, Zainol, et al.
Published: (2022)
English learning system using speech recognition for visual impaired user (ELSRVI)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)
by: Rajinder Singh, Harvinder Kaur
Published: (2012)
Digital speech watermarking for online speaker recognition systems
by: Nematollahi, Mohammad Ali
Published: (2015)
by: Nematollahi, Mohammad Ali
Published: (2015)
Digital audio and speech watermarking based on the multiple discrete wavelets transform and singular value decomposition
by: Nematollahi, Mohammad Ali, et al.
Published: (2012)
by: Nematollahi, Mohammad Ali, et al.
Published: (2012)
Robust digital speech watermarking for online speaker recognition
by: Nematollahi, Mohammad Ali, et al.
Published: (2015)
by: Nematollahi, Mohammad Ali, et al.
Published: (2015)
Speech emotion recognition using deep feedforward neural network
by: Alghifari, Muhammad Fahreza, et al.
Published: (2018)
by: Alghifari, Muhammad Fahreza, et al.
Published: (2018)
Isolated word speech recognition of the malay digits
by: Mohamad Nasir, Haidawati, et al.
Published: (2014)
by: Mohamad Nasir, Haidawati, et al.
Published: (2014)
Increasing the use of audio visual aids in development work
by: Mohd. Tahir, Mohd. Hanim, et al.
Published: (1983)
by: Mohd. Tahir, Mohd. Hanim, et al.
Published: (1983)
Herb leaves pattern recognition using digital microscope and deep learning
by: Rahman, Muhammad Ariff Azizul, et al.
Published: (2019)
by: Rahman, Muhammad Ariff Azizul, et al.
Published: (2019)
Non-Speech Audio To Sustain Attention In A Multimedia E-Learning Environment
by: Yuen, May Chan
Published: (2007)
by: Yuen, May Chan
Published: (2007)
Faith, Flight and Foreign Policy: Effects of war and migration on Western Australian Bosnian Muslims
by: Vujcich, Daniel
Published: (2016)
by: Vujcich, Daniel
Published: (2016)
Semi-fragile digital speech watermarking for online speaker recognition
by: Nematollahi, Mohammad Ali, et al.
Published: (2015)
by: Nematollahi, Mohammad Ali, et al.
Published: (2015)
Written discourse in digital audio folklore
by: Mansor, Noraien, et al.
Published: (2017)
by: Mansor, Noraien, et al.
Published: (2017)
Performance evaluation of lossless speech and audio compression algorithms
by: Gunawan, Teddy Surya, et al.
Published: (2011)
by: Gunawan, Teddy Surya, et al.
Published: (2011)
Digital speech watermarking for anti-spoofing attack in speaker recognition
by: Nematollahi, Mohammad Ali, et al.
Published: (2014)
by: Nematollahi, Mohammad Ali, et al.
Published: (2014)
English digits speech recognition system based on Hidden Markov Models
by: Abushariah, Ahmad A. M., et al.
Published: (2010)
by: Abushariah, Ahmad A. M., et al.
Published: (2010)
Identification of audio and room parameters for optimum speech intelligibility in room
by: Ng, Tsing Chun
Published: (2007)
by: Ng, Tsing Chun
Published: (2007)
Digital audio watermarking; techniques and applications
by: Olanweraju, Rashidah Funke, et al.
Published: (2012)
by: Olanweraju, Rashidah Funke, et al.
Published: (2012)
Face recognition using deep learning
by: Ooi, Zi Xen
Published: (2019)
by: Ooi, Zi Xen
Published: (2019)
Consonants recognition and noise reduction for Arabic phonemes based Malay speakers / Ali Abd Almisreb
by: Almisreb, Ali Abd
Published: (2016)
by: Almisreb, Ali Abd
Published: (2016)
Consonants recognition and noise reduction for Arabic phonemes based Malay speakers / Ali Abd Almisreb
by: Abd Almisreb, Ali
Published: (2015)
by: Abd Almisreb, Ali
Published: (2015)
English digits speech recognition system based on hidden Markov Models
by: Gunawan, Teddy Surya, et al.
Published: (2011)
by: Gunawan, Teddy Surya, et al.
Published: (2011)
Development of an Isolated Digit Speech Recognition Based on Multilayer Perceptron Model
by: Mohamad Hussin, Ummu Salmah
Published: (2004)
by: Mohamad Hussin, Ummu Salmah
Published: (2004)
Employing Psychoacoustic Model for Digital Audio Watermarking
by: Chai Siew, Li
Published: (2013)
by: Chai Siew, Li
Published: (2013)
Pill Recognition via Deep Learning Approaches
by: Mohd Rais Hakim, Ramlee, et al.
Published: (2024)
by: Mohd Rais Hakim, Ramlee, et al.
Published: (2024)
Similar Items
-
Development of audio-visual speech recognition using deep-learning technique
by: How, Chun Kit, et al.
Published: (2022) -
A review of audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2018) -
A novel lip geometry approach for audio-visual speech recognition
by: Mohd Zamri, Ibrahim
Published: (2014) -
Result comparison of model validation techniques on audio-visual speech recognition
by: Thum, Wei Seong, et al.
Published: (2017) -
Deep word embeddings for visual speech recognition
by: Stafylakis, Themos, et al.
Published: (2018)