Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique

Bibliographic Details
Format:	Restricted Document

_version_	1860800044149506048
building	INTELEK Repository
collection	Online Access
collectionurl	https://intelek.unisza.edu.my/intelek/pages/search.php?search=!collection407072
date	2020-08-13 04:29:12
eventvenue	Riyadh; Saudi Arabia
format	Restricted Document
id	8421
institution	UniSZA
originalfilename	1819-01-FH03-FIK-20-39681.pdf
person	like Gecko) Chrome/84.0.4147.125 Safari/537.36 Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML
recordtype	oai_dc
resourceurl	https://intelek.unisza.edu.my/intelek/pages/view.php?ref=8421
spelling	8421 https://intelek.unisza.edu.my/intelek/pages/view.php?ref=8421 https://intelek.unisza.edu.my/intelek/pages/search.php?search=!collection407072 Restricted Document Conference Conference Paper application/pdf 7 1.6 Adobe Acrobat Pro DC 20 Paper Capture Plug-in like Gecko) Chrome/84.0.4147.125 Safari/537.36 Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML 2020-08-13 04:29:12 1819-01-FH03-FIK-20-39681.pdf UniSZA Private Access Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique Imbalanced datasets typically occur in many real applications. Resampling is one of the effective solutions due to producing a balanced class distribution. Synthetic Minority Over-sampling technique (SMOTE), an over-sampling technique is used in this study for dealing the imbalanced dataset by add the number of instances of a minority class. This technique is used to decrease the imbalance percentage of the dataset by generating new synthetic samples. Thus, a balanced training dataset is produced to replace the class imbalanced . The balanced datasets were obtained and trained with machine learning algorithms to diagnose the disease’s class. Through the experiment findings on the real-world datasets, oral cancer dataset and erythemato-squamous diseases dataset from the UCI machine learning datasets, an over-sampling method showed better results in clinical disease classification. 1st International Conference on Intelligent Cloud Computing, ICC 2019 Riyadh; Saudi Arabia
spellingShingle	Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique
summary	Imbalanced datasets typically occur in many real applications. Resampling is one of the effective solutions due to producing a balanced class distribution. Synthetic Minority Over-sampling technique (SMOTE), an over-sampling technique is used in this study for dealing the imbalanced dataset by add the number of instances of a minority class. This technique is used to decrease the imbalance percentage of the dataset by generating new synthetic samples. Thus, a balanced training dataset is produced to replace the class imbalanced . The balanced datasets were obtained and trained with machine learning algorithms to diagnose the disease’s class. Through the experiment findings on the real-world datasets, oral cancer dataset and erythemato-squamous diseases dataset from the UCI machine learning datasets, an over-sampling method showed better results in clinical disease classification.
title	Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique
title_full	Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique
title_fullStr	Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique
title_full_unstemmed	Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique
title_short	Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique
title_sort	improving accuracy of imbalanced clinical data classification using synthetic minority over-sampling technique

Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique

Similar Items