Web application for lipreading

Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing...

Full description

Bibliographic Details
Main Author: Lau, Yee Lin
Format: Final Year Project / Dissertation / Thesis
Published: 2024
Subjects:
Online Access:http://eprints.utar.edu.my/6818/
http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf
_version_ 1848886775761600512
author Lau, Yee Lin
author_facet Lau, Yee Lin
author_sort Lau, Yee Lin
building UTAR Institutional Repository
collection Online Access
description Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing deep learning model, LipCoordNet. It allows users to upload video to the web application and the application will generate text and video output for the users to visualize the speech instead of listening to the sounds. The users can choose to download the predicted text to their own device for future usage. Based on the output of the lipreading, the average word error rate (WER) and character error rate (CER) of an Asian speaker and a Native speaker is calculated, resulting in the average WER and CER value of the Asian speaker being higher than that of the Native speaker. To reduce the WER and CER of the sentences spoken by Asian speakers, efforts have been made in trying to train the LipCoordNet model with the Asian speakers dataset. 270 Asian speaker dataset has been collected with 27 Asian speakers speaking 10 sentences each. For the evaluation of the usability of the web application, five respondents are selected to participate in the system usability testing and contribute to the system usability scale (SUS) score. The SUS score obtained is 87.5, indicating that the system receives a grade A with the adjective rating of Excellent.
first_indexed 2025-11-15T19:43:51Z
format Final Year Project / Dissertation / Thesis
id utar-6818
institution Universiti Tunku Abdul Rahman
institution_category Local University
last_indexed 2025-11-15T19:43:51Z
publishDate 2024
recordtype eprints
repository_type Digital Repository
spelling utar-68182024-11-21T05:17:06Z Web application for lipreading Lau, Yee Lin QA76 Computer software T Technology (General) Communication happens every day in our daily lives. However, there are conditions where the communication occurs in an environment which impedes the listener from listening to the message clearly. Therefore, this project aims to develop a web application that can perform lipreading using an existing deep learning model, LipCoordNet. It allows users to upload video to the web application and the application will generate text and video output for the users to visualize the speech instead of listening to the sounds. The users can choose to download the predicted text to their own device for future usage. Based on the output of the lipreading, the average word error rate (WER) and character error rate (CER) of an Asian speaker and a Native speaker is calculated, resulting in the average WER and CER value of the Asian speaker being higher than that of the Native speaker. To reduce the WER and CER of the sentences spoken by Asian speakers, efforts have been made in trying to train the LipCoordNet model with the Asian speakers dataset. 270 Asian speaker dataset has been collected with 27 Asian speakers speaking 10 sentences each. For the evaluation of the usability of the web application, five respondents are selected to participate in the system usability testing and contribute to the system usability scale (SUS) score. The SUS score obtained is 87.5, indicating that the system receives a grade A with the adjective rating of Excellent. 2024 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf Lau, Yee Lin (2024) Web application for lipreading. Final Year Project, UTAR. http://eprints.utar.edu.my/6818/
spellingShingle QA76 Computer software
T Technology (General)
Lau, Yee Lin
Web application for lipreading
title Web application for lipreading
title_full Web application for lipreading
title_fullStr Web application for lipreading
title_full_unstemmed Web application for lipreading
title_short Web application for lipreading
title_sort web application for lipreading
topic QA76 Computer software
T Technology (General)
url http://eprints.utar.edu.my/6818/
http://eprints.utar.edu.my/6818/1/2005403_LAU_YEE_LIN.pdf