Hate speech detection in Chinese language using deep learning
recent years, the rise of cyberbullying and online sexism has had devastating consequences, with Chinese social media platforms such as Sina Weibo and Zhihu seeing increased incidents of online harassment, leading to severe outcomes like suicide. To combat this, the project aims to develop deep lear...
| Main Author: | |
|---|---|
| Format: | Final Year Project / Dissertation / Thesis |
| Published: |
2024
|
| Subjects: | |
| Online Access: | http://eprints.utar.edu.my/6955/ http://eprints.utar.edu.my/6955/1/fyp_CS_2024_LHB.pdf |
| _version_ | 1848886811542159360 |
|---|---|
| author | Lim, Hazel Benin |
| author_facet | Lim, Hazel Benin |
| author_sort | Lim, Hazel Benin |
| building | UTAR Institutional Repository |
| collection | Online Access |
| description | recent years, the rise of cyberbullying and online sexism has had devastating consequences, with Chinese social media platforms such as Sina Weibo and Zhihu seeing increased incidents of online harassment, leading to severe outcomes like suicide. To combat this, the project aims to develop deep learning models that effectively classify sexist content in Chinese social media. Despite extensive research on English-language cyberbullying detection, there is limited focus on Chinese contexts, particularly regarding sexism. This study utilizes the Sina Weibo Sexism Review (SWSR) dataset, evaluating several recurrent neural network (RNN) architectures, including RNN, LSTM, GRU, Bi-LSTM, Bi-GRU, RNN-LSTM, and RNN-GRU. These models were tested on balanced and imbalanced datasets, yielding accuracy rates between 74.2% and 76.8%. Precision, recall, and F1 scores ranged from 0.6818 to 0.7447, indicating strong classification performance. Moreover, incorporating emoji embeddings and English-Chinese translation further improved model accuracy and sensitivity in identifying sexist content. This research provides a significant contribution toward addressing online harassment in Chinese text, offering actionable insights for future cyberbullying detection systems. |
| first_indexed | 2025-11-15T19:44:26Z |
| format | Final Year Project / Dissertation / Thesis |
| id | utar-6955 |
| institution | Universiti Tunku Abdul Rahman |
| institution_category | Local University |
| last_indexed | 2025-11-15T19:44:26Z |
| publishDate | 2024 |
| recordtype | eprints |
| repository_type | Digital Repository |
| spelling | utar-69552025-02-27T07:04:41Z Hate speech detection in Chinese language using deep learning Lim, Hazel Benin HX Socialism. Communism. Anarchism T Technology (General) recent years, the rise of cyberbullying and online sexism has had devastating consequences, with Chinese social media platforms such as Sina Weibo and Zhihu seeing increased incidents of online harassment, leading to severe outcomes like suicide. To combat this, the project aims to develop deep learning models that effectively classify sexist content in Chinese social media. Despite extensive research on English-language cyberbullying detection, there is limited focus on Chinese contexts, particularly regarding sexism. This study utilizes the Sina Weibo Sexism Review (SWSR) dataset, evaluating several recurrent neural network (RNN) architectures, including RNN, LSTM, GRU, Bi-LSTM, Bi-GRU, RNN-LSTM, and RNN-GRU. These models were tested on balanced and imbalanced datasets, yielding accuracy rates between 74.2% and 76.8%. Precision, recall, and F1 scores ranged from 0.6818 to 0.7447, indicating strong classification performance. Moreover, incorporating emoji embeddings and English-Chinese translation further improved model accuracy and sensitivity in identifying sexist content. This research provides a significant contribution toward addressing online harassment in Chinese text, offering actionable insights for future cyberbullying detection systems. 2024-06 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6955/1/fyp_CS_2024_LHB.pdf Lim, Hazel Benin (2024) Hate speech detection in Chinese language using deep learning. Final Year Project, UTAR. http://eprints.utar.edu.my/6955/ |
| spellingShingle | HX Socialism. Communism. Anarchism T Technology (General) Lim, Hazel Benin Hate speech detection in Chinese language using deep learning |
| title | Hate speech detection in Chinese language using deep learning |
| title_full | Hate speech detection in Chinese language using deep learning |
| title_fullStr | Hate speech detection in Chinese language using deep learning |
| title_full_unstemmed | Hate speech detection in Chinese language using deep learning |
| title_short | Hate speech detection in Chinese language using deep learning |
| title_sort | hate speech detection in chinese language using deep learning |
| topic | HX Socialism. Communism. Anarchism T Technology (General) |
| url | http://eprints.utar.edu.my/6955/ http://eprints.utar.edu.my/6955/1/fyp_CS_2024_LHB.pdf |