A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh

Credit card fraud is a growing concern in the financial industry. While financial losses from credit card fraud amount to billions of dollars each year, investigations on effective predictive models to identify fraud cases using real credit card data are limited currently, mainly due to confidential...

Full description

Bibliographic Details
Main Author: Kuldeep Kaur , Ragbir Singh
Format: Thesis
Published: 2019
Subjects:
Online Access:http://studentsrepo.um.edu.my/10752/
http://studentsrepo.um.edu.my/10752/2/Kuldeep_Kaur.pdf
http://studentsrepo.um.edu.my/10752/1/Kuldeep_Kaur_%E2%80%93_Dissertation.pdf
_version_ 1848774220648022016
author Kuldeep Kaur , Ragbir Singh
author_facet Kuldeep Kaur , Ragbir Singh
author_sort Kuldeep Kaur , Ragbir Singh
building UM Research Repository
collection Online Access
description Credit card fraud is a growing concern in the financial industry. While financial losses from credit card fraud amount to billions of dollars each year, investigations on effective predictive models to identify fraud cases using real credit card data are limited currently, mainly due to confidentiality of customer information. To bridge this gap, this research embarks on developing a hybrid machine learning approach to identify credit card fraud cases based on both benchmark and real-world data. Standard base machine learning algorithms, which include a total of twelve individual methods as well as the AdaBoost and Bagging methods, are firstly used. The voting-based hybrid approach consisting of various machine learning models with the ability to tackle issues related to missing and imbalanced data is then developed. To evaluate the efficacy of the models, publicly available financial and credit card data sets are evaluated. A real credit card data set from a financial institution is also analysed, in order to evaluate the effectiveness of the proposed hybrid approach. In addition to the standard hybrid approach, a sliding window method is further evaluated using the real-world credit card data, with the aim to simulate and assess the capability of real-time identification of fraud cases at the financial institution. The empirical results positively indicate that the hybrid model with the sliding window method is able to yield a good accuracy rate of 82.4% in detecting fraud cases in real world credit card transactions.
first_indexed 2025-11-14T13:54:51Z
format Thesis
id um-10752
institution University Malaya
institution_category Local University
last_indexed 2025-11-14T13:54:51Z
publishDate 2019
recordtype eprints
repository_type Digital Repository
spelling um-107522020-01-20T00:17:23Z A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh Kuldeep Kaur , Ragbir Singh QA75 Electronic computers. Computer science Credit card fraud is a growing concern in the financial industry. While financial losses from credit card fraud amount to billions of dollars each year, investigations on effective predictive models to identify fraud cases using real credit card data are limited currently, mainly due to confidentiality of customer information. To bridge this gap, this research embarks on developing a hybrid machine learning approach to identify credit card fraud cases based on both benchmark and real-world data. Standard base machine learning algorithms, which include a total of twelve individual methods as well as the AdaBoost and Bagging methods, are firstly used. The voting-based hybrid approach consisting of various machine learning models with the ability to tackle issues related to missing and imbalanced data is then developed. To evaluate the efficacy of the models, publicly available financial and credit card data sets are evaluated. A real credit card data set from a financial institution is also analysed, in order to evaluate the effectiveness of the proposed hybrid approach. In addition to the standard hybrid approach, a sliding window method is further evaluated using the real-world credit card data, with the aim to simulate and assess the capability of real-time identification of fraud cases at the financial institution. The empirical results positively indicate that the hybrid model with the sliding window method is able to yield a good accuracy rate of 82.4% in detecting fraud cases in real world credit card transactions. 2019-08 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/10752/2/Kuldeep_Kaur.pdf application/pdf http://studentsrepo.um.edu.my/10752/1/Kuldeep_Kaur_%E2%80%93_Dissertation.pdf Kuldeep Kaur , Ragbir Singh (2019) A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh. Masters thesis, University of Malaya. http://studentsrepo.um.edu.my/10752/
spellingShingle QA75 Electronic computers. Computer science
Kuldeep Kaur , Ragbir Singh
A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh
title A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh
title_full A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh
title_fullStr A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh
title_full_unstemmed A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh
title_short A voting-based hybrid machine learning approach for fraudulent financial data classification / Kuldeep Kaur Ragbir Singh
title_sort voting-based hybrid machine learning approach for fraudulent financial data classification / kuldeep kaur ragbir singh
topic QA75 Electronic computers. Computer science
url http://studentsrepo.um.edu.my/10752/
http://studentsrepo.um.edu.my/10752/2/Kuldeep_Kaur.pdf
http://studentsrepo.um.edu.my/10752/1/Kuldeep_Kaur_%E2%80%93_Dissertation.pdf