Fake news detection: A machine learning approach

The spread of fake news is nothing new in the current day and age, there is a lot of news being spread in Malaysia related to the Covid-19 pandemic, some of which may not be true. Websites like Sebenarnya.my and Malaysiakini can be used to check whether a news headline is true, however this is a man...

Full description

Bibliographic Details
Main Author: Yeoh, Dennis Guan Lee
Format: Final Year Project / Dissertation / Thesis
Published: 2021
Subjects:
Online Access:http://eprints.utar.edu.my/4255/
http://eprints.utar.edu.my/4255/1/17ACB01328_FYP.pdf
_version_ 1848886110770429952
author Yeoh, Dennis Guan Lee
author_facet Yeoh, Dennis Guan Lee
author_sort Yeoh, Dennis Guan Lee
building UTAR Institutional Repository
collection Online Access
description The spread of fake news is nothing new in the current day and age, there is a lot of news being spread in Malaysia related to the Covid-19 pandemic, some of which may not be true. Websites like Sebenarnya.my and Malaysiakini can be used to check whether a news headline is true, however this is a manual and tedious process. Furthermore, there are currently no datasets available that specifically focus on Covid-19 headlines in Malaysia. This project aims to reduce the spread of fake news in Malaysia by developing a web application that can ease and automate the news verification process. The aim of this project was achieved through several objectives. Firstly, a small dataset that is specific to Covid-19 headlines in Malaysia was collected. Next, a competent classification model for determining whether a headline regarding Covid-19 in Malaysia is true, fake, or unsure was trained by using the dataset collected. Finally, a web application was developed to deploy the trained model. The originality of this project lies in the fact that the dataset used to train the model was self-collected. The main contribution of this project on the other hand is the web application that deviates from the usual data verification process which is often done manually. The data collected for the creation of the dataset is obtained in the form of tweets using a Twitter API. These tweets are then labelled as Real, Fake and Unsure according to the sources that posted the tweets. The tweet data then undergoes several pre-processing steps in order to prepare it for model training. Once the dataset was created, several machine learning algorithms were used to train several different models. These models were evaluated in order to pick one to be deployed to the web application. The final model chosen to be deployed was a model trained using a Multinomial Naïve Bayes algorithm.
first_indexed 2025-11-15T19:33:17Z
format Final Year Project / Dissertation / Thesis
id utar-4255
institution Universiti Tunku Abdul Rahman
institution_category Local University
last_indexed 2025-11-15T19:33:17Z
publishDate 2021
recordtype eprints
repository_type Digital Repository
spelling utar-42552022-03-09T13:10:51Z Fake news detection: A machine learning approach Yeoh, Dennis Guan Lee QA75 Electronic computers. Computer science T Technology (General) The spread of fake news is nothing new in the current day and age, there is a lot of news being spread in Malaysia related to the Covid-19 pandemic, some of which may not be true. Websites like Sebenarnya.my and Malaysiakini can be used to check whether a news headline is true, however this is a manual and tedious process. Furthermore, there are currently no datasets available that specifically focus on Covid-19 headlines in Malaysia. This project aims to reduce the spread of fake news in Malaysia by developing a web application that can ease and automate the news verification process. The aim of this project was achieved through several objectives. Firstly, a small dataset that is specific to Covid-19 headlines in Malaysia was collected. Next, a competent classification model for determining whether a headline regarding Covid-19 in Malaysia is true, fake, or unsure was trained by using the dataset collected. Finally, a web application was developed to deploy the trained model. The originality of this project lies in the fact that the dataset used to train the model was self-collected. The main contribution of this project on the other hand is the web application that deviates from the usual data verification process which is often done manually. The data collected for the creation of the dataset is obtained in the form of tweets using a Twitter API. These tweets are then labelled as Real, Fake and Unsure according to the sources that posted the tweets. The tweet data then undergoes several pre-processing steps in order to prepare it for model training. Once the dataset was created, several machine learning algorithms were used to train several different models. These models were evaluated in order to pick one to be deployed to the web application. The final model chosen to be deployed was a model trained using a Multinomial Naïve Bayes algorithm. 2021-04-15 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/4255/1/17ACB01328_FYP.pdf Yeoh, Dennis Guan Lee (2021) Fake news detection: A machine learning approach. Final Year Project, UTAR. http://eprints.utar.edu.my/4255/
spellingShingle QA75 Electronic computers. Computer science
T Technology (General)
Yeoh, Dennis Guan Lee
Fake news detection: A machine learning approach
title Fake news detection: A machine learning approach
title_full Fake news detection: A machine learning approach
title_fullStr Fake news detection: A machine learning approach
title_full_unstemmed Fake news detection: A machine learning approach
title_short Fake news detection: A machine learning approach
title_sort fake news detection: a machine learning approach
topic QA75 Electronic computers. Computer science
T Technology (General)
url http://eprints.utar.edu.my/4255/
http://eprints.utar.edu.my/4255/1/17ACB01328_FYP.pdf