Identifying and categorising profane words in hate speech

This study attempts to explore the different types of Hate Speech appearing in social media by identifying profane words used in hate speech. This study also compares the profane words used in different generations to assist in identifying the user's profile. Five-hundred (500) comments posted...

Full description

Bibliographic Details
Main Authors: Teh, Phoey Lee *, Cheng, Chi-Bin, Chee, Weng Mun
Format: Conference or Workshop Item
Language:English
Published: 2018
Subjects:
Online Access:http://eprints.sunway.edu.my/919/
http://eprints.sunway.edu.my/919/1/Teh%20Phoey%20Lee%20Identifying%20and%20Categorising%20Profane%20Words%20in%20Hate%20Speech%20%28Preprint%29.pdf
_version_ 1848801927709589504
author Teh, Phoey Lee *
Cheng, Chi-Bin
Chee, Weng Mun
author_facet Teh, Phoey Lee *
Cheng, Chi-Bin
Chee, Weng Mun
author_sort Teh, Phoey Lee *
building SU Institutional Repository
collection Online Access
description This study attempts to explore the different types of Hate Speech appearing in social media by identifying profane words used in hate speech. This study also compares the profane words used in different generations to assist in identifying the user's profile. Five-hundred (500) comments posted on YouTube on the abusive topics were collected. Profane words are classified into eight different types of hate speech. The finding shows 35% of profane words found in our sample are words related to sexual orientation. Comparison of the terms between 1970 and 2017 also show a high percentage of profane words are sexual orientation. Though the results are found based on only 500 comments collected from YouTube link in the current study, they are useful in establishing the list of profane words which will serve as the base for automatic hate speech identification in our future study. The originality of this research is the development of a training list of profane words for each category and comparison of the type of the words used in 1970 century with today's social media platform.
first_indexed 2025-11-14T21:15:14Z
format Conference or Workshop Item
id sunway-919
institution Sunway University
institution_category Local University
language English
last_indexed 2025-11-14T21:15:14Z
publishDate 2018
recordtype eprints
repository_type Digital Repository
spelling sunway-9192019-06-11T00:48:46Z http://eprints.sunway.edu.my/919/ Identifying and categorising profane words in hate speech Teh, Phoey Lee * Cheng, Chi-Bin Chee, Weng Mun P Philology. Linguistics This study attempts to explore the different types of Hate Speech appearing in social media by identifying profane words used in hate speech. This study also compares the profane words used in different generations to assist in identifying the user's profile. Five-hundred (500) comments posted on YouTube on the abusive topics were collected. Profane words are classified into eight different types of hate speech. The finding shows 35% of profane words found in our sample are words related to sexual orientation. Comparison of the terms between 1970 and 2017 also show a high percentage of profane words are sexual orientation. Though the results are found based on only 500 comments collected from YouTube link in the current study, they are useful in establishing the list of profane words which will serve as the base for automatic hate speech identification in our future study. The originality of this research is the development of a training list of profane words for each category and comparison of the type of the words used in 1970 century with today's social media platform. 2018-03-23 Conference or Workshop Item PeerReviewed text en cc_by_nc http://eprints.sunway.edu.my/919/1/Teh%20Phoey%20Lee%20Identifying%20and%20Categorising%20Profane%20Words%20in%20Hate%20Speech%20%28Preprint%29.pdf Teh, Phoey Lee * and Cheng, Chi-Bin and Chee, Weng Mun (2018) Identifying and categorising profane words in hate speech. In: 2nd International Conference on Compute and Data Analysis (ICCDA 2018), 23 - 25 March 2018, Northern Illinois University , Dekalb, Chicago. 10.1145/3193077.3193078 doi:10.1145/3193077.3193078
spellingShingle P Philology. Linguistics
Teh, Phoey Lee *
Cheng, Chi-Bin
Chee, Weng Mun
Identifying and categorising profane words in hate speech
title Identifying and categorising profane words in hate speech
title_full Identifying and categorising profane words in hate speech
title_fullStr Identifying and categorising profane words in hate speech
title_full_unstemmed Identifying and categorising profane words in hate speech
title_short Identifying and categorising profane words in hate speech
title_sort identifying and categorising profane words in hate speech
topic P Philology. Linguistics
url http://eprints.sunway.edu.my/919/
http://eprints.sunway.edu.my/919/
http://eprints.sunway.edu.my/919/
http://eprints.sunway.edu.my/919/1/Teh%20Phoey%20Lee%20Identifying%20and%20Categorising%20Profane%20Words%20in%20Hate%20Speech%20%28Preprint%29.pdf