Developing a fine-tuned transformer model to detect social media hate speech texts

This research explores the development and evaluation of a hate speech detection system using transformer-based models, focusing on the robustness, efficiency, and scalability of the model. The study emphasizes key design considerations, including scalability, which addresses the model's capabi...

Full description

Bibliographic Details
Main Author:	Chung, Master Ek-Karat
Format:	Final Year Project / Dissertation / Thesis
Published:	2024
Subjects:	T Technology (General)
Online Access:	http://eprints.utar.edu.my/6896/ http://eprints.utar.edu.my/6896/1/fyp_IA_2024_CMEK.pdf

_version_	1848886794379067392
author	Chung, Master Ek-Karat
author_facet	Chung, Master Ek-Karat
author_sort	Chung, Master Ek-Karat
building	UTAR Institutional Repository
collection	Online Access
description	This research explores the development and evaluation of a hate speech detection system using transformer-based models, focusing on the robustness, efficiency, and scalability of the model. The study emphasizes key design considerations, including scalability, which addresses the model's capability to handle large volumes of data, and accuracy, achieved through fine-tuning methods for transformer models like BERT. Reviewing model to do proper performance analysis on existing model in detecting Social Media Hate Speech Texts such as Long Short-Term Memory (LSTM) and Bidirectional Gated Recurrent Unit (Bi-GRU), GigaBERT for Arabic Hate Speech Detection, BERT-Based Approaches, DistilBERT and RoBERTa, T5 and Electra, Comparison of Transformer Models and its Challenges and Limitation. Besides, it also briefly discusses on system design to ensure that the model is conceptually accurate, scalable, and maintainable, providing a flexible framework for ongoing research in hate speech detection on social media. The research also discusses on the facing challenges such as data imbalance, computational limitations, and extensive hyperparameter tuning, all of which were addressed through various techniques and strategies. This research show system's experiment/ simulation to show performance with evaluated using a Logistic Regression model on a split dataset, fine-tuning with GridSearchCV, how the model's accuracy improved. The experiment successfully show a predictive model with high accuracy and precision, also indicated future improvements in detecting hate speech on social media. The results underscore the importance of ongoing refinement in machine learning models or deep learning model to address complex, real-world issues such as hate speech detection.
first_indexed	2025-11-15T19:44:09Z
format	Final Year Project / Dissertation / Thesis
id	utar-6896
institution	Universiti Tunku Abdul Rahman
institution_category	Local University
last_indexed	2025-11-15T19:44:09Z
publishDate	2024
recordtype	eprints
repository_type	Digital Repository
spelling	utar-68962025-02-14T07:37:09Z Developing a fine-tuned transformer model to detect social media hate speech texts Chung, Master Ek-Karat T Technology (General) This research explores the development and evaluation of a hate speech detection system using transformer-based models, focusing on the robustness, efficiency, and scalability of the model. The study emphasizes key design considerations, including scalability, which addresses the model's capability to handle large volumes of data, and accuracy, achieved through fine-tuning methods for transformer models like BERT. Reviewing model to do proper performance analysis on existing model in detecting Social Media Hate Speech Texts such as Long Short-Term Memory (LSTM) and Bidirectional Gated Recurrent Unit (Bi-GRU), GigaBERT for Arabic Hate Speech Detection, BERT-Based Approaches, DistilBERT and RoBERTa, T5 and Electra, Comparison of Transformer Models and its Challenges and Limitation. Besides, it also briefly discusses on system design to ensure that the model is conceptually accurate, scalable, and maintainable, providing a flexible framework for ongoing research in hate speech detection on social media. The research also discusses on the facing challenges such as data imbalance, computational limitations, and extensive hyperparameter tuning, all of which were addressed through various techniques and strategies. This research show system's experiment/ simulation to show performance with evaluated using a Logistic Regression model on a split dataset, fine-tuning with GridSearchCV, how the model's accuracy improved. The experiment successfully show a predictive model with high accuracy and precision, also indicated future improvements in detecting hate speech on social media. The results underscore the importance of ongoing refinement in machine learning models or deep learning model to address complex, real-world issues such as hate speech detection. 2024-05 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6896/1/fyp_IA_2024_CMEK.pdf Chung, Master Ek-Karat (2024) Developing a fine-tuned transformer model to detect social media hate speech texts. Final Year Project, UTAR. http://eprints.utar.edu.my/6896/
spellingShingle	T Technology (General) Chung, Master Ek-Karat Developing a fine-tuned transformer model to detect social media hate speech texts
title	Developing a fine-tuned transformer model to detect social media hate speech texts
title_full	Developing a fine-tuned transformer model to detect social media hate speech texts
title_fullStr	Developing a fine-tuned transformer model to detect social media hate speech texts
title_full_unstemmed	Developing a fine-tuned transformer model to detect social media hate speech texts
title_short	Developing a fine-tuned transformer model to detect social media hate speech texts
title_sort	developing a fine-tuned transformer model to detect social media hate speech texts
topic	T Technology (General)
url	http://eprints.utar.edu.my/6896/ http://eprints.utar.edu.my/6896/1/fyp_IA_2024_CMEK.pdf

Developing a fine-tuned transformer model to detect social media hate speech texts

Similar Items