Developing a fine-tuned transformer model to detect social media hate speech texts

This research explores the development and evaluation of a hate speech detection system using transformer-based models, focusing on the robustness, efficiency, and scalability of the model. The study emphasizes key design considerations, including scalability, which addresses the model's capabi...

Full description

Bibliographic Details
Main Author: Chung, Master Ek-Karat
Format: Final Year Project / Dissertation / Thesis
Published: 2024
Subjects:
Online Access:http://eprints.utar.edu.my/6896/
http://eprints.utar.edu.my/6896/1/fyp_IA_2024_CMEK.pdf
_version_ 1848886794379067392
author Chung, Master Ek-Karat
author_facet Chung, Master Ek-Karat
author_sort Chung, Master Ek-Karat
building UTAR Institutional Repository
collection Online Access
description This research explores the development and evaluation of a hate speech detection system using transformer-based models, focusing on the robustness, efficiency, and scalability of the model. The study emphasizes key design considerations, including scalability, which addresses the model's capability to handle large volumes of data, and accuracy, achieved through fine-tuning methods for transformer models like BERT. Reviewing model to do proper performance analysis on existing model in detecting Social Media Hate Speech Texts such as Long Short-Term Memory (LSTM) and Bidirectional Gated Recurrent Unit (Bi-GRU), GigaBERT for Arabic Hate Speech Detection, BERT-Based Approaches, DistilBERT and RoBERTa, T5 and Electra, Comparison of Transformer Models and its Challenges and Limitation. Besides, it also briefly discusses on system design to ensure that the model is conceptually accurate, scalable, and maintainable, providing a flexible framework for ongoing research in hate speech detection on social media. The research also discusses on the facing challenges such as data imbalance, computational limitations, and extensive hyperparameter tuning, all of which were addressed through various techniques and strategies. This research show system's experiment/ simulation to show performance with evaluated using a Logistic Regression model on a split dataset, fine-tuning with GridSearchCV, how the model's accuracy improved. The experiment successfully show a predictive model with high accuracy and precision, also indicated future improvements in detecting hate speech on social media. The results underscore the importance of ongoing refinement in machine learning models or deep learning model to address complex, real-world issues such as hate speech detection.
first_indexed 2025-11-15T19:44:09Z
format Final Year Project / Dissertation / Thesis
id utar-6896
institution Universiti Tunku Abdul Rahman
institution_category Local University
last_indexed 2025-11-15T19:44:09Z
publishDate 2024
recordtype eprints
repository_type Digital Repository
spelling utar-68962025-02-14T07:37:09Z Developing a fine-tuned transformer model to detect social media hate speech texts Chung, Master Ek-Karat T Technology (General) This research explores the development and evaluation of a hate speech detection system using transformer-based models, focusing on the robustness, efficiency, and scalability of the model. The study emphasizes key design considerations, including scalability, which addresses the model's capability to handle large volumes of data, and accuracy, achieved through fine-tuning methods for transformer models like BERT. Reviewing model to do proper performance analysis on existing model in detecting Social Media Hate Speech Texts such as Long Short-Term Memory (LSTM) and Bidirectional Gated Recurrent Unit (Bi-GRU), GigaBERT for Arabic Hate Speech Detection, BERT-Based Approaches, DistilBERT and RoBERTa, T5 and Electra, Comparison of Transformer Models and its Challenges and Limitation. Besides, it also briefly discusses on system design to ensure that the model is conceptually accurate, scalable, and maintainable, providing a flexible framework for ongoing research in hate speech detection on social media. The research also discusses on the facing challenges such as data imbalance, computational limitations, and extensive hyperparameter tuning, all of which were addressed through various techniques and strategies. This research show system's experiment/ simulation to show performance with evaluated using a Logistic Regression model on a split dataset, fine-tuning with GridSearchCV, how the model's accuracy improved. The experiment successfully show a predictive model with high accuracy and precision, also indicated future improvements in detecting hate speech on social media. The results underscore the importance of ongoing refinement in machine learning models or deep learning model to address complex, real-world issues such as hate speech detection. 2024-05 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6896/1/fyp_IA_2024_CMEK.pdf Chung, Master Ek-Karat (2024) Developing a fine-tuned transformer model to detect social media hate speech texts. Final Year Project, UTAR. http://eprints.utar.edu.my/6896/
spellingShingle T Technology (General)
Chung, Master Ek-Karat
Developing a fine-tuned transformer model to detect social media hate speech texts
title Developing a fine-tuned transformer model to detect social media hate speech texts
title_full Developing a fine-tuned transformer model to detect social media hate speech texts
title_fullStr Developing a fine-tuned transformer model to detect social media hate speech texts
title_full_unstemmed Developing a fine-tuned transformer model to detect social media hate speech texts
title_short Developing a fine-tuned transformer model to detect social media hate speech texts
title_sort developing a fine-tuned transformer model to detect social media hate speech texts
topic T Technology (General)
url http://eprints.utar.edu.my/6896/
http://eprints.utar.edu.my/6896/1/fyp_IA_2024_CMEK.pdf