Effective detection of purchasing intention for online shopping

The main issue with the below expectations in detecting purchasing intention is caused by the unbalanced data set and its overlapping class problem. To identify a sampling method that best improves the detection rate, this project performed four categories of sampling experiments, resulting in 2,011...

Full description

Bibliographic Details
Main Author: Kang, Shu Yi
Format: Final Year Project / Dissertation / Thesis
Published: 2023
Subjects:
Online Access:http://eprints.utar.edu.my/5884/
http://eprints.utar.edu.my/5884/1/SE_1903794_FYP_report_%2D_KangShuYi_%2D_SHU_YI_KANG.pdf
_version_ 1848886528398327808
author Kang, Shu Yi
author_facet Kang, Shu Yi
author_sort Kang, Shu Yi
building UTAR Institutional Repository
collection Online Access
description The main issue with the below expectations in detecting purchasing intention is caused by the unbalanced data set and its overlapping class problem. To identify a sampling method that best improves the detection rate, this project performed four categories of sampling experiments, resulting in 2,011 experiments in total. To improve the detection results, a hybrid of undersampling and oversampling was applied to reduce and increase the size of the majority and minority classes of the unbalanced data set used in this project, respectively. Undersampling rates from 10% to 80%, and oversampling rates from 10% to 90% are used in combinations to achieve effective detections for the class "Buy", which is the minority in the data set. Random undersampling and five variants of Synthetic Minority Oversampling Techniques (SMOTE): Standard SMOTE, ADASYN, ANS, Borderline SMOTE, and SVM SMOTE, were utilised on the data set. Then, the resulting data sets were crossvalidated and tested with five classifiers: Decision Tree, Logistic Regression, Naïve Bayes, Random Forest and SVM. The result indicated that applying Random Forest with the random undersampling rate of 80% and oversampling rate (ANS) of 80% yielded the best recall in detecting the majority and minority classes overall.
first_indexed 2025-11-15T19:39:56Z
format Final Year Project / Dissertation / Thesis
id utar-5884
institution Universiti Tunku Abdul Rahman
institution_category Local University
last_indexed 2025-11-15T19:39:56Z
publishDate 2023
recordtype eprints
repository_type Digital Repository
spelling utar-58842023-10-05T12:08:11Z Effective detection of purchasing intention for online shopping Kang, Shu Yi QA76 Computer software The main issue with the below expectations in detecting purchasing intention is caused by the unbalanced data set and its overlapping class problem. To identify a sampling method that best improves the detection rate, this project performed four categories of sampling experiments, resulting in 2,011 experiments in total. To improve the detection results, a hybrid of undersampling and oversampling was applied to reduce and increase the size of the majority and minority classes of the unbalanced data set used in this project, respectively. Undersampling rates from 10% to 80%, and oversampling rates from 10% to 90% are used in combinations to achieve effective detections for the class "Buy", which is the minority in the data set. Random undersampling and five variants of Synthetic Minority Oversampling Techniques (SMOTE): Standard SMOTE, ADASYN, ANS, Borderline SMOTE, and SVM SMOTE, were utilised on the data set. Then, the resulting data sets were crossvalidated and tested with five classifiers: Decision Tree, Logistic Regression, Naïve Bayes, Random Forest and SVM. The result indicated that applying Random Forest with the random undersampling rate of 80% and oversampling rate (ANS) of 80% yielded the best recall in detecting the majority and minority classes overall. 2023 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/5884/1/SE_1903794_FYP_report_%2D_KangShuYi_%2D_SHU_YI_KANG.pdf Kang, Shu Yi (2023) Effective detection of purchasing intention for online shopping. Final Year Project, UTAR. http://eprints.utar.edu.my/5884/
spellingShingle QA76 Computer software
Kang, Shu Yi
Effective detection of purchasing intention for online shopping
title Effective detection of purchasing intention for online shopping
title_full Effective detection of purchasing intention for online shopping
title_fullStr Effective detection of purchasing intention for online shopping
title_full_unstemmed Effective detection of purchasing intention for online shopping
title_short Effective detection of purchasing intention for online shopping
title_sort effective detection of purchasing intention for online shopping
topic QA76 Computer software
url http://eprints.utar.edu.my/5884/
http://eprints.utar.edu.my/5884/1/SE_1903794_FYP_report_%2D_KangShuYi_%2D_SHU_YI_KANG.pdf