A comparative study for outlier detection techniques in data mining

Existing studies in data mining mostly focus on finding patterns in large datasets and further using it for organizational decision making. However, finding such exceptions and outliers has not yet received as much attention in the data mining field as some other topics have, such as association rul...

Full description

Bibliographic Details
Main Authors: Bakar, Zuriana Abu, Mohemad, R., Ahmad, A., Mat Deris, Mustafa Mat
Format: Conference or Workshop Item
Language:English
Published: 2006
Subjects:
Online Access:http://eprints.utm.my/7808/
http://eprints.utm.my/7808/1/Mat_Deris_Mustafa_2006_Comparative_Study_Outlier_Detection_Techniques.pdf
_version_ 1848891549474095104
author Bakar, Zuriana Abu
Mohemad, R.
Ahmad, A.
Mat Deris, Mustafa Mat
author_facet Bakar, Zuriana Abu
Mohemad, R.
Ahmad, A.
Mat Deris, Mustafa Mat
author_sort Bakar, Zuriana Abu
building UTeM Institutional Repository
collection Online Access
description Existing studies in data mining mostly focus on finding patterns in large datasets and further using it for organizational decision making. However, finding such exceptions and outliers has not yet received as much attention in the data mining field as some other topics have, such as association rules, classification and clustering. Thus, this paper describes the performance of control chart, linear regression, and Manhattan distance techniques for outlier detection in data mining. Experimental studies show that outlier detection technique using control chart is better than the technique modeled from linear regression because the number of outlier data detected by control chart is smaller than linear regression. Further, experimental studies shows that Manhattan distance technique outperformed compared with the other techniques when the threshold values increased.
first_indexed 2025-11-15T20:59:44Z
format Conference or Workshop Item
id utm-7808
institution Universiti Teknologi Malaysia
institution_category Local University
language English
last_indexed 2025-11-15T20:59:44Z
publishDate 2006
recordtype eprints
repository_type Digital Repository
spelling utm-78082017-08-30T01:32:48Z http://eprints.utm.my/7808/ A comparative study for outlier detection techniques in data mining Bakar, Zuriana Abu Mohemad, R. Ahmad, A. Mat Deris, Mustafa Mat QA75 Electronic computers. Computer science Existing studies in data mining mostly focus on finding patterns in large datasets and further using it for organizational decision making. However, finding such exceptions and outliers has not yet received as much attention in the data mining field as some other topics have, such as association rules, classification and clustering. Thus, this paper describes the performance of control chart, linear regression, and Manhattan distance techniques for outlier detection in data mining. Experimental studies show that outlier detection technique using control chart is better than the technique modeled from linear regression because the number of outlier data detected by control chart is smaller than linear regression. Further, experimental studies shows that Manhattan distance technique outperformed compared with the other techniques when the threshold values increased. 2006 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/7808/1/Mat_Deris_Mustafa_2006_Comparative_Study_Outlier_Detection_Techniques.pdf Bakar, Zuriana Abu and Mohemad, R. and Ahmad, A. and Mat Deris, Mustafa Mat (2006) A comparative study for outlier detection techniques in data mining. In: 2006 IEEE Conference on Cybernetics and Intelligent Systems, 7-9 June 2006. http://dx.doi.org/10.1109/ICCIS.2006.252287
spellingShingle QA75 Electronic computers. Computer science
Bakar, Zuriana Abu
Mohemad, R.
Ahmad, A.
Mat Deris, Mustafa Mat
A comparative study for outlier detection techniques in data mining
title A comparative study for outlier detection techniques in data mining
title_full A comparative study for outlier detection techniques in data mining
title_fullStr A comparative study for outlier detection techniques in data mining
title_full_unstemmed A comparative study for outlier detection techniques in data mining
title_short A comparative study for outlier detection techniques in data mining
title_sort comparative study for outlier detection techniques in data mining
topic QA75 Electronic computers. Computer science
url http://eprints.utm.my/7808/
http://eprints.utm.my/7808/
http://eprints.utm.my/7808/1/Mat_Deris_Mustafa_2006_Comparative_Study_Outlier_Detection_Techniques.pdf