Hate speech detection in Chinese language using deep learning

recent years, the rise of cyberbullying and online sexism has had devastating consequences, with Chinese social media platforms such as Sina Weibo and Zhihu seeing increased incidents of online harassment, leading to severe outcomes like suicide. To combat this, the project aims to develop deep lear...

Full description

Bibliographic Details
Main Author: Lim, Hazel Benin
Format: Final Year Project / Dissertation / Thesis
Published: 2024
Subjects:
Online Access:http://eprints.utar.edu.my/6955/
http://eprints.utar.edu.my/6955/1/fyp_CS_2024_LHB.pdf
_version_ 1848886811542159360
author Lim, Hazel Benin
author_facet Lim, Hazel Benin
author_sort Lim, Hazel Benin
building UTAR Institutional Repository
collection Online Access
description recent years, the rise of cyberbullying and online sexism has had devastating consequences, with Chinese social media platforms such as Sina Weibo and Zhihu seeing increased incidents of online harassment, leading to severe outcomes like suicide. To combat this, the project aims to develop deep learning models that effectively classify sexist content in Chinese social media. Despite extensive research on English-language cyberbullying detection, there is limited focus on Chinese contexts, particularly regarding sexism. This study utilizes the Sina Weibo Sexism Review (SWSR) dataset, evaluating several recurrent neural network (RNN) architectures, including RNN, LSTM, GRU, Bi-LSTM, Bi-GRU, RNN-LSTM, and RNN-GRU. These models were tested on balanced and imbalanced datasets, yielding accuracy rates between 74.2% and 76.8%. Precision, recall, and F1 scores ranged from 0.6818 to 0.7447, indicating strong classification performance. Moreover, incorporating emoji embeddings and English-Chinese translation further improved model accuracy and sensitivity in identifying sexist content. This research provides a significant contribution toward addressing online harassment in Chinese text, offering actionable insights for future cyberbullying detection systems.
first_indexed 2025-11-15T19:44:26Z
format Final Year Project / Dissertation / Thesis
id utar-6955
institution Universiti Tunku Abdul Rahman
institution_category Local University
last_indexed 2025-11-15T19:44:26Z
publishDate 2024
recordtype eprints
repository_type Digital Repository
spelling utar-69552025-02-27T07:04:41Z Hate speech detection in Chinese language using deep learning Lim, Hazel Benin HX Socialism. Communism. Anarchism T Technology (General) recent years, the rise of cyberbullying and online sexism has had devastating consequences, with Chinese social media platforms such as Sina Weibo and Zhihu seeing increased incidents of online harassment, leading to severe outcomes like suicide. To combat this, the project aims to develop deep learning models that effectively classify sexist content in Chinese social media. Despite extensive research on English-language cyberbullying detection, there is limited focus on Chinese contexts, particularly regarding sexism. This study utilizes the Sina Weibo Sexism Review (SWSR) dataset, evaluating several recurrent neural network (RNN) architectures, including RNN, LSTM, GRU, Bi-LSTM, Bi-GRU, RNN-LSTM, and RNN-GRU. These models were tested on balanced and imbalanced datasets, yielding accuracy rates between 74.2% and 76.8%. Precision, recall, and F1 scores ranged from 0.6818 to 0.7447, indicating strong classification performance. Moreover, incorporating emoji embeddings and English-Chinese translation further improved model accuracy and sensitivity in identifying sexist content. This research provides a significant contribution toward addressing online harassment in Chinese text, offering actionable insights for future cyberbullying detection systems. 2024-06 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6955/1/fyp_CS_2024_LHB.pdf Lim, Hazel Benin (2024) Hate speech detection in Chinese language using deep learning. Final Year Project, UTAR. http://eprints.utar.edu.my/6955/
spellingShingle HX Socialism. Communism. Anarchism
T Technology (General)
Lim, Hazel Benin
Hate speech detection in Chinese language using deep learning
title Hate speech detection in Chinese language using deep learning
title_full Hate speech detection in Chinese language using deep learning
title_fullStr Hate speech detection in Chinese language using deep learning
title_full_unstemmed Hate speech detection in Chinese language using deep learning
title_short Hate speech detection in Chinese language using deep learning
title_sort hate speech detection in chinese language using deep learning
topic HX Socialism. Communism. Anarchism
T Technology (General)
url http://eprints.utar.edu.my/6955/
http://eprints.utar.edu.my/6955/1/fyp_CS_2024_LHB.pdf