Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques

One of the significant problems in the credit card fraud domain is the increasing number of imbalanced data. The higher ratio of majority to minority classes can lead to misleading results, as conventional machine learning algorithms assume equal class distribution. The first contribution of this re...

Full description

Bibliographic Details
Main Author: Gasim, Esraa Faisal Malik
Format: Thesis
Language:English
Published: 2023
Subjects:
Online Access:http://eprints.usm.my/60174/
http://eprints.usm.my/60174/1/ESRAA%20FAISAL%20MALIK%20GASIM%20-%20TESIS%20cut.pdf
_version_ 1848884371662045184
author Gasim, Esraa Faisal Malik
author_facet Gasim, Esraa Faisal Malik
author_sort Gasim, Esraa Faisal Malik
building USM Institutional Repository
collection Online Access
description One of the significant problems in the credit card fraud domain is the increasing number of imbalanced data. The higher ratio of majority to minority classes can lead to misleading results, as conventional machine learning algorithms assume equal class distribution. The first contribution of this research is to develop a new preprocessing technique that utilizes cost-sensitive learning and resampling techniques at the data-level to improve the performance of highly imbalanced datasets. The developed preprocessing technique consists of three phases. In the first phase, several resampling techniques at the data-level, such as SMOTE-ENN, SMOTE-TOMEK, SMOTE-OSS, SMOTE-RUS, and ROS-RUS with their default parameters, are compared to find the optimum technique with the highest performance. The second phase involves using cost-sensitive learning with different ratios to determine the best range of ratios to be used in phase three. Subsequently, in the third phase, the percentage of resampling techniques at the data-level is fine-tuned to avoid losing crucial information or producing repetitive synthetic data that could cause overfitting. Additionally, the cost-sensitive learning ratio is fine-tuned to determine the misclassification costs in the minority class. The developed new preprocessing technique was found to have a positive impact in terms of F1-measure and misclassification rate in contrast to the conventional resampling techniques. Furthermore, the negative effect of financial crimes on financial institutions has grown dramatically over the years. The second contribution to this research is to develop multiple hybrid machine learning models in order to enhance the detection of fraudulent activities in the credit card fraud detection domain.
first_indexed 2025-11-15T19:05:39Z
format Thesis
id usm-60174
institution Universiti Sains Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T19:05:39Z
publishDate 2023
recordtype eprints
repository_type Digital Repository
spelling usm-601742024-03-13T07:17:21Z http://eprints.usm.my/60174/ Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques Gasim, Esraa Faisal Malik HG4001-4285 Finance management. Business finance.Corporation finance One of the significant problems in the credit card fraud domain is the increasing number of imbalanced data. The higher ratio of majority to minority classes can lead to misleading results, as conventional machine learning algorithms assume equal class distribution. The first contribution of this research is to develop a new preprocessing technique that utilizes cost-sensitive learning and resampling techniques at the data-level to improve the performance of highly imbalanced datasets. The developed preprocessing technique consists of three phases. In the first phase, several resampling techniques at the data-level, such as SMOTE-ENN, SMOTE-TOMEK, SMOTE-OSS, SMOTE-RUS, and ROS-RUS with their default parameters, are compared to find the optimum technique with the highest performance. The second phase involves using cost-sensitive learning with different ratios to determine the best range of ratios to be used in phase three. Subsequently, in the third phase, the percentage of resampling techniques at the data-level is fine-tuned to avoid losing crucial information or producing repetitive synthetic data that could cause overfitting. Additionally, the cost-sensitive learning ratio is fine-tuned to determine the misclassification costs in the minority class. The developed new preprocessing technique was found to have a positive impact in terms of F1-measure and misclassification rate in contrast to the conventional resampling techniques. Furthermore, the negative effect of financial crimes on financial institutions has grown dramatically over the years. The second contribution to this research is to develop multiple hybrid machine learning models in order to enhance the detection of fraudulent activities in the credit card fraud detection domain. 2023-07 Thesis NonPeerReviewed application/pdf en http://eprints.usm.my/60174/1/ESRAA%20FAISAL%20MALIK%20GASIM%20-%20TESIS%20cut.pdf Gasim, Esraa Faisal Malik (2023) Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques. PhD thesis, Universiti Sains Malaysia.
spellingShingle HG4001-4285 Finance management. Business finance.Corporation finance
Gasim, Esraa Faisal Malik
Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques
title Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques
title_full Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques
title_fullStr Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques
title_full_unstemmed Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques
title_short Credit Card Fraud Detection Using New Preprocessing And Hybrid Machine Learning Techniques
title_sort credit card fraud detection using new preprocessing and hybrid machine learning techniques
topic HG4001-4285 Finance management. Business finance.Corporation finance
url http://eprints.usm.my/60174/
http://eprints.usm.my/60174/1/ESRAA%20FAISAL%20MALIK%20GASIM%20-%20TESIS%20cut.pdf