Deep learning-based audio-visual speech recognition for Bosnian digits

This study presents a deep learning-based solution for audio-visual speech recognition of Bosnian digits. The task posed a challenge due to the lack of an appropriate Bosnian language dataset, and this study outlines the approach to building a new dataset. The proposed solution includes two comp...

Full description

Bibliographic Details
Main Authors: Husein Fazlić, Ali Abd Almisreb, Nooritawati Md Tahir
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2024
Online Access:http://journalarticle.ukm.my/25132/
http://journalarticle.ukm.my/25132/1/14.pdf